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EXPRESSION VECTORS FOR STIMULATING AN IMMUNE 
RESPONSE AND METHODS OF USING THE SAME 

CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims the benefit of 09/078,904, filed May 13, 1998, and 
60/085,751, filed May 15, 1998, both herein incorporated by reference in their entirety. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
. FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

This invention was made with government support under NIH Grant No. 

. ^ ...vT- *T-,oco^ ni xTTtr ror,h-<,rt IsTn NOl - AT-45241 . The 

Ai-4^(5yy-Ul, iNLti uraiu inu. /u.jojo-t-uj, ohu ii^^^ ^-^ 

Government has certain rights in this invention. 

FIELD OF THE INVENTION 
The present invention relates to nucleic acid vaccines encoding multiple 
CTL and HTL epitopes and MHC targeting sequences. 

BACKGROUND OF THE INVENTION 
Vaccines are of fundamental importance in modem medicine and have 
been highly effective in combating certain human diseases. However, despite the 
successful implementation of vaccination programs that have greatly limited or virtually 
eliminated several debiUtating human diseases, there are a number of diseases that affect 
miUions worldwide for which effective vaccines have not been developed. 

Major advances in the field of immunology have led to a greater 
understanding of the mechanisms involved in the immune response and have provided 
insights into developing new vaccine strategies (Kuby, Immunology, 443-457 (3rd ed., 
1997), which is incorporated herein by reference). These new vaccine strategies have 
taken advantage of knowledge gained regarding the mechanisms by which foreign 
material, termed antigen, is recognized by the immune system and eliminated firom the 
organism. An effective vaccine is one that eUcits an immune response to an antigen of 
interest. 

SpeciaUzed ceUs of the immune system are responsible for the protective 
activity required to combat diseases. An immune response involves two major groups of 
cells, lymphocytes, or white blood cells, and antigen-presenting cells. The purpose of 
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these immune response cells is to recognize foreign material, such as an infectious 
organism or a cancer cell, and remove that foreign material from the organism. 

Two major types of lymphocytes mediate different aspects of the immune 
response. B cells display on their cell surface specialized proteins, called antibodies, that 

5 bind specifically to foreign material, called antigens. Effector B cells produce soluble 
forms of the antibody - which circulate throughout the body and function to eliminate 
antigen from the organism. This branch of the immune system is known as the humoral 
branch. Memory B cells function to recognize the antigen in fiiture encounters by 
continuing to express the membrane-bound form of the antibody. 

10 A second major type of lymphocyte is the T cell. T cells also have on their 

cell surface specialized proteins that recognize antigen but, in contrast to B cells, require 
that the antigen be bound to a specialized membrane protein complex, the major 
histocompatibility complex (MHC), on the surface of an antigen-presenting cell. Two 
major classes of T cells, termed helper T lymphocytes ("HTL") and cytotoxic T 

1 5 lymphocytes ("CTL"), are often distinguished based on the presence of either CD4 or 
CDS protein, respectively, on the cell surface. This branch of the immune system is 
known as the cell-mediated branch. 

The second major class of immune response cells are cells that function in 
antigen presentation by processing antigen for binding to MHC molecules expressed in 

20 the antigen presenting cells. The processed antigen bound to MHC molecules is 

transferred to the surface of the cell, where the antigen-MHC complex is available to bind 
to T cells. 

MHC molecules can be divided into MHC class I and class II molecules 
and are recognized by the two classes of T cells. Nearly all cells express MHC class I 

25 molecules, which function to present antigen to cytotoxic T lymphocytes. Cytotoxic T 
lymphocytes typically recognize antigen bound to MHC class I. A subset of cells called 
antigen-presenting cells express MHC class H molecules. Helper T lymphocytes 
typically recognize antigen bound to MHC class H molecules. Antigen-presenting cells 
include dendritic cells, macrophages, B cells, fibroblasts, glial cells, pancreatic beta cells, 

30 thymic epithelial cells, thyroid epithelial cells and vascular endothelial cells. These 

antigen-presenting cells generally express both MHC class I and class II molecules. Also, 
B cells function as both antibody-producing and antigen-presenting cells. 

Once a helper T ly-mphocite recognizes an antigen-MHC class U complex 
on the surface of an antigen-presenting cell, the helper T lymphocyte becomes activated 

- 2 - 
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and produces growth factors that activate a variety of cells involved in the immune 
response, including B cells and cytotoxic T lymphocytes. For example, under the 
influence of growth factors expressed by activated helper T lymphocytes, a cytotoxic T 
lymphocyte that recognizes an antigen-MHC class I complex becomes activated. CTLs 
5 monitor and eliminate cells that display antigen specifically recognized by the CTL, such 
as infected cells or tumor cells. Thus, activation of helper T lymphocytes stimulates the 
activation of both the humoral and cell-mediated branches of the immune system. 

An important aspect of the immune response, in particular as it relates to 
vaccine efficacy, is the manner in which antigen is processed so that it can be recognized 
10 by the specialized cells of the immune system. Distinct antigen processing and 

presentation pathways are utilized. Vac one is a cytosolic pathway, which results in the 
antigen being bound to MHC class I molecules. An alternative pathway is an 
endoplasmic reticulum pahtway, which bypasses the cytosol. Another is an endocylic 
pathway, which results in the antigen being bound to MHC class II molecules. Thus, the 
15 cell surface presentation of a particular antigen by a MHC class H or class I molecule to a 
helper T lymphocyte or a cytotoxic T lymphocyte, respectively, is dependent on the 
processing pathway for that antigen. 

The cytosolic pathway processes endogenous antigens that are expressed 
inside the cell. The antigen is degraded by a specialized protease complex in the cytosol 
20 of the cell, and the resulting antigen peptides are transported into the endoplasmic 
reticulum, an organelle that processes cell surface molecules. In the endoplasmic 
reticulum, the antigen peptides bind to MHC class I molecules, which are then 
transported to the cell surface for presentation to cytotoxic T lymphocytes of the immune 
system. 

25 Antigens that exist outside the cell are processed by the endocytic 

pathway. Such antigens are taken into the cell by endocytosis, which brings the antigens 
into specialized vesicles called endosomes and subsequently to specialized vesicles called 
lysosomes, where the antigen is degraded by proteases into antigen peptides that bind to 
MHC class n molecules. The antigen peptide-MHC class H molecule complex is then 

30 transported to the cell surface for presentation to helper T lymphocytes of the immune 
system. 

A variety of factors must be considered in the development of an effective 
vaccine. For example, the extent of activation of either the humoral or cell-mediated 
branch of the immune system can detennine the effectiveness of a vaccine against a 

- 3 - 
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particular disease. Furthermore, the development of iirununologic memory by inducing 
memory-cell formation can be important for an effective vaccine against a particular 
disease (Kuby, supra). For example, protection from infectious diseases caused by 
pathogens with short incubation periods, such as influenza virus, requires high levels of 

5 neutralizing antibody generated by the humoral branch because disease symptoms are 
already underway before memory cells are activated. Alternatively, protection from 
infectious diseases caused by pathogens with long incubation periods, such as polio virus, 
does not require neutralizing antibodies at the time of infection but instead requires 
memory B cells that can generate neutralizing antibodies to combat the pathogen before it 

1 0 is able to infect target tissues. Therefore, the effectiveness of a vaccine at preventing or 
ameliorating the sympiorns of a particular disease depends on the type of immune 
response generated by the vaccine. 

Many traditional vaccines have relied on intact pathogens such as 
attenuated or inactivated viruses or bacteria to elicit an immune response. However, 

1 5 these traditional vaccines have advantages and disadvantages, including reversion of an 
attenuated pathogen to a virulent form. The problem of reversion of an attenuated 
vaccine has been addressed by the use of molecules of the pathogen rather than the whole 
pathogen. For example, immunization approaches have begun to incorporate 
recombinant vector vaccines and synthetic peptide vaccines (Kuby, supra). Recently, 

20 DNA vaccines have also been used (Donnelly et al. Annu. Rev. Immunol. 15:61 7-648 
(1997), which is bcorporated herein by reference). The use of molecules of a pathogen 
provides safe vaccines that circumvent the potential for reversion to a virulent fomi of the 
vaccine. 

The targeting of antigens to MHC class 11 molecules to activate helper T 
25 lymphocytes has been described using lysosomal targeting sequences, which direct 

antigens to lysosomes, where the antigen is digested by lysosomal proteases into antigen 

peptides that bind to MHC class H molecules (U.S. Patent No. 5,633,234; Thomson et al. 

J. Virol. 72:2246-2252 (1998)). It would be advantageous to develop vaccines that 

deliver multiple antigens while exploiting the safety provided by administering individual 
30 epitopes of a pathogen rather than a whole organism. In particular, it would be 

advantageous to develop vaccines that effectively target antigens to MHC class II 

molecules for activation of helper T lymphocytes. 

Several studies also point to the crucial role of cytotoxic T cells in both 

production and eradication of infectious diseases and cancer by the immune system 
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(Byrne et al. J. Immunol. 51:682 (1984); McMichael et al., N. Engl. J. Med. 309:13 
(1983)). Recombinant protein vaccines do not reliably induce CTL responses, and the 
use of otherwise immunogenic vaccines consisting of attenuated pathogens in humans is 
hampered, in the case of several important diseases, by overriding safety concerns. In the 
5 case of diseases such as HIV, HBV, HCV, and malaria, it appears desirable not only to 
induce a vigorous CTL response, but also to focus the response against highly conserved 
epitopes in order to prevent escape by mutation and overcome variable vaccine efficacy 
against different isolates of the target pathogen. 

Induction of a broad response directed simultaneously against multiple 

10 epitopes also appears to be crucial for development of efficacious vaccines. HIV 
mfecuoii 15 perhaps the best exainple where aii infecicd host rnay benefit from a 
multispecific response. Rapid progression of HIV infection has been reported in cases 
where a narrowly focused CTL response is induced whereas nonprogressors tend to show 
a broader specificity of CTLs (Goulder et al. Nat. Med. 3:212 (1997); Borrow et al, Nat. 

15 Med. 3:205 (1997)). The highly variable nature of HIV CTL epitopes resulting from a 
highly mutating genome and selection by CTL responses directed against only a single or 
few epitopes also supports the need for broad epitope CTL responses (McMichael et al. 
Annu. Rev. Immunol. 15:271 (1997)). 

One potential approach to induce multispecific responses against 

20 conserved epitopes is immunization with a minigene plasmid encoding the epitopes in a 
string-of-beads fashion. Induction of CTL, HTL, and B cell responses in mice by 
minigene plasmids have been described by several laboratories using constructs encoding 
as many as 11 epitopes (An etal, J. Virol. 71:2292 (1997); Thomson et al, J. Immunol. 
157:822 (1996); Whition et al, J. Virol 67:348 (1993); Hanke etal. Vaccine 16:426 

25 (1998); Vitiello et al. Eur. J. Immunol. 27:671-678 (1997)). Minigenes have been 

delivered in vivo by infection with recombinant adenovirus or vaccinia, or by injection of 
purified DNA via the intramuscular or intradermal route (Thomson et al, J. Immunol 
160:1717 (1998); Toes et al. Proc. Natl Acad. ScL USA 94:14660 (1997)). 

Successfiil development of minigene DNA vaccines for human use will 

30 require addressing certain fundamental questions dealing with epitope MHC affinity, 
optimization of constructs for maximum in vivo immunogenicity, and development of 
assays for testing in vivo potency of multi-epitope minigene constructs. Regarding IVHC 
binding affinity of epitopes, it is not currently known whether both high and low affinity 
epitopes can be included within a single minigene construct, and what ranges of peptide 
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affinity are permissible for CTL induction in vivo. This is especiaUy important because 
dominant epitopes can vary in their affinity and because it might be important to be able 
to deliver mixmres of dominant and subdominant epitopes that are characterized by high 
and low MHC binding affinities. 

5 With respect to minigene construct optimization for maximum 

immunogenicity in vivo, conflicting data exists regarding whether the exact position of 
the epitopes in a given construct or the presence of flanking regions, helper T cell 
epitopes, and signal sequences might be crucial for CTL induction (Del Val et ai. Cell 
66:1 145 (1991); Bergmann et ai. J. Virol. 68:5306 (1994); Thomson et al. Proc. Natl. 

10 Acad. Sci. USA 92:5845 (1995); Shirai et al., J. Infect. Dis. 173:24 (1996); Rahemtulla et 
al.. Nature 353:180 (1991); Jennings et al.. Cell. Immunol. 133:234 (1991); Anderson et 
al.. J. Exp. Med. 174:489 (1991); Uger et al. J. Immunol. 158:685 (1997)). Finally, 
regarding development of assays that allow testing of human vaccine candidates, it should 
be noted that, to date, all in vivo immunogenicity data of multi-epitope minigene plasmids 

1 5 have been performed with murine class I MHC-restricted epitopes. It would be 

advantageous to be able to test the in vivo immunogenicity of minigenes containing 
human CTL epitopes in a convenient animal model system. 

Thus, there exists a need to develop methods to effectively deliver a 
variety of HTL (helper T lymphocyte) and CTL (cytotoxic T lymphocyte) antigens to 

20 stimulate an immune response. The present invention satisfies this need and provides 
related advantages as well. 

SUMMARY OF THE INVENTION 
The invention therefore provides expression vectors encoding two or more 

25 HTL epitopes fiised to a MHC class H targeting sequence, as well as expression vectors 
encoding a CTL epitope and a universal HTL epitope fused to an MHC class I targeting 
sequence. The HTL epitope can be a universal HTL epitope (also referred to as a 
universal MHC class H epitope). The invention also provides expression vectors 
encoding two or more HTL epitopes fiised to a MHC class U targeting sequence and 

30 encoding one or more CTL epitopes. The invention additionally provides methods of 

stimulating an immune response by administering an expression vector of the invention in 
vivo, as well as methods of assaying the human immunogenicity of a human T cell 
peptide epitope in vivo in a non-human mammal. 

- 6 - 
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In one aspect, the present invention provides an expression vector 
comprising a promoter operably linked to a first nucleotide sequence encoding a major 
histocompatibility (MHC) targeting sequence fused to a second nucleotide sequence 
encoding two or more heterologous peptide epitopes, wherein the heterologous peptide 
5 epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a universal 
HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
immune response in vivo comprising administering to a mammalian subject an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
10 major histocompatibility (MHC) targeting sequence fused to a second nucleotide 

sequence encoding two or more heterologous peptide epitopes, wherem the heterologous 
peptide epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a 
universal HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
15 immune response in vivo comprising administering to a mammalian subject an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
major histocompatibility (MHC) targeting sequence fused to a second nucleotide 
sequence encoding a heterologous human HTL peptide epitope. 

In another aspect, the present invention provides a method of assaying the 
20 human immunogenicity of a human T cell peptide epitope in vivo in a non-human 

mammal, comprising the step of administering to the non-human mammal an expression 
vector comprising a promoter operably linked to a furst nucleotide sequence encoding a 
heterologous human CTL or HTL peptide epitope. 

In one embodiment, the heterologous peptide epitopes comprise two or 
25 more heterologous HTL peptide epitopes. In another embodiment, the heterologous 
peptide epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 
In another embodiment, the heterologous peptide epitopes further comprise one to two or 
more heterologous CTL peptide epitopes. In another embodiment, the expression vector 
comprises both HTL and CTL peptide epitopes. 
30 In one embodiment, one of the HTL peptide epitopes is a universal HTL 

epitope. In another embodiment, the universal HTL epitope is a pan DR epitope. In 
another embodiment, the pan DR epitope has the sequence 
AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 
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In one embodiment, the peptide epitopes are hepatitis B virus epitopes, 
hepatitis C virus epitopes, human immunodeficiency virus epitopes, human papilloma 
virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, PAP epitopes, p53 
epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. In another 
5 embodiment, the peptide epitopes each have a sequence selected from the group 

consisting of the peptides depicted in Tables 1-8. In another embodiment, at least one of 
the peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

In one embodiment, the MHC targeting sequence comprises a region of a 
polypeptide selected from the group consisting of the li protein, LAMP -I, HLS-DM, 
1 0 HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface antigen, hepatitis B virus 
core antigen, Ty panicle, Ig-ct protein, Ig-p protein, and Ig kappa chain signal sequence. 

In one embodiment, the expression vector further comprises a second 
promoter sequence operably linked to a third nucleotide sequence encoding one or more 
heterologous HTL or CTL peptide epitopes. In another embodiment, the CTL peptide 
1 5 epitope comprises a sffuctural motif for an HL A supertype, whereby the peptide CTL 
epitope binds to two or more members of the supertype with an affmity of greater that 
500 nM. In another embodiment, the CTL peptide epitopes have structural motifs that 
provide binding affinity for more than one HLA allele supertype. 

In one embodiment, the non-human mammal is a transgenic mouse that 
20 expresses a human HLA allele. In another embodiment, the human HLA allele is selected 
from the group consisting of Al 1 and A2.1. In another embodiment, the non-human 
mammal is a macaque that expresses a human HLA allele. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 shows the nucleotide and amino acid sequences (SEQ ID N0S:1 

and 2, respectively) of the liPADRE construct encoding a fusion of the murine li gene 
with a pan DR epitope sequence substituted for the CLIP sequence of the li protein. 

Figure 2 shows the nucleotide and amino acid sequences (SEQ ID N0S:3 
and 4, respectively) of the I80T construct encoding a fusion of the cytoplasmic domain, 
30 the transmembrane domain and part of the luminal domain of the li protein fused to 
multiple MHC class n epitopes. 

Figure 3 shows the nucleotide and amino acid sequences (SEQ ED N0S:5 
and 6, respectively) of the liThfull construct encoding a fusion of the cytoplasmic 
domain, transmembrane domain and a portion of the luminal domain of the li protein 
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fused to multiple T helper epitopes and amino acid residues 101 to 215 of the li protein, 
which encodes the trimerization region of the li protein. 

Figure 4 shows the nucleotide and amino acid sequences (SEQ ID N0S:7 
and 8, respectively) of the KappaLAMP-Th construct encoding a fiision of the murine 
5 immunoglobulin kappa signal sequence fused to multiple T helper epitopes and the 
transmembrane and cytoplasmic domains of LAMP-1 . 

Figure 5 shows the nucleotide and amino acid sequences (SEQ ID N0S:9 
and 10, respectively) of the H2M-Th construct encoding a fusion of the signal sequence 
of H2-M fused to multiple MHC class H epitopes and the transmembrane and 
10 cytoplasmic domains of H2-M. 

Figure 6 shows the nucleotide and amino acid sequences (SEQ ID N0S:1 1 
and 12, respectively) of the H20-Th construct encoding a fusion of die signal sequence of 
H2-D0 fused to multiple MHC class n epitopes and the transmembrane and cytoplasmic 
domains of H2-D0. 

1 5 Figure 7 shows the nucleotide and amino acid sequences (SEQ ID NOS : 1 3 

and 14, respectively) of the PADRE-Influenza matrix construct encoding a fusion of a 
pan DR epitope sequence fused to the amino-terminus of influenza matiix protein. 

Figure 8 shows the nucleotide and amino acid sequences (SEQ ID NOS: 15 
and 16, respectively) of the PADRE-HBV-s constinict encoding a fusion of a pan DR 
20 epitope sequence fused to the amino-terminus of hepatitis B virus surface antigen. 

Figure 9 shows the nucleotide and amino acid sequences (SEQ ED NOS: 17 
and 18, respectively) of the Ig-alphaTh constiiict encoding a fusion of the signal sequence 
of the Ig-a protein fused to multiple MHC class II epitopes and the transmembrane and 
cytoplasmic domains of the Ig-a protein. 
25 Figiu-e 10 shows the nucleotide and amino acid sequences (SEQ ID 

NOS: 19 and 20, respectively) of the Ig-betaTh constinict encoding a fusion of the signal 
sequence of the Ig-p protein fused to multiple MHC class II epitopes and the 
transmembrane and cytoplasmic domains of the Ig-P protein. 

Figure 1 1 shows the nucleotide and amino acid sequences (SEQ ID 
30 N0S:21 and 22, respectively) of the SigTh construct encoding a fusion of the signal 
sequence of the kappa immunoglobulin fiised to multiple MHC class H epitopes. 

Figure 12 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:23 and 24, respectively) of human HLA-DR, the invariant chain (li) protein. 
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Figure 13 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:25 and 26, respectively) of human lysosomal membrane glycoprotein- 1 (LAMP-1). 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:27 and 28, respectively) of human HLA-DMB. 
5 Figure 1 5 shows the nucleotide and amino acid sequences (SEQ ID 

NOS:29 and 30. respectively) of human HLA-DO beta. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
N0S:3 1 and 32, respectively) of the human MB-1 Ig-a. 

Figure 17 shows the nucleotide and amino acid sequences (SEQ ID 
10 NOS:33 and 34, respectively) of human Ig-|3 protein. 

Figure 1 8 shows a schematic diactam denictinp the method of generating 
some of the constructs encoding a MHC class 11 targeting sequence fused to multiple 
MHC class II epitopes. 

Figure 19 shows the nucleotide sequence of the vector pEP2 (SEQ ID 

15 NO:35). 

Figure 20 shows the nucleotide sequence of the vector pMIN.O (SEQ ID 

NO:36). 

Figure 21 shows the nucleotide sequence of the vector pMIN.l (SEQ ID 

NO:37). 

20 Figure 22. Representative CTL responses in HLA-A2. 1/K''-H-2'"" mice 

immunized with pMin.l DNA. Splenocytes from primed animals were cultured in 
triplicate flasks and stimulated twice in vitro with each peptide epitope. Cytotoxicity of 
each culture was assayed in a ^'Cr release assay against Jurkat-A2.1/K'' target cells in the 
presence (filled symbols, solid lines) or absence (open symbols, dotted lines) of peptide. 

25 Each symbol represents the response of a single culture. 

Figure 23. Presentation of viral epitopes to specific CTLs by Jurkat- 
A2.1/K'' tumor cells transfected with DNA minigene. Two constructs were used for 
transfection,.pMin.l and pMin.2-GFP. pMin.2-GFP-transfected targets cells were sorted 
by FACS and the population used in this experiment contained 60% fluorescent cells. 

30 CTL stimulation was measured by quantitating the amoimt of IFN-y release (A, B) or by 
lysis of ^'Cr-labeled target cells (C, D, hatched bars). CTLs were stimulated with 
transfected cells (A, C) or with parental Jurlcat-A2. IfK^ cells in the presence of 1 tig/ml 
peptide (B, D). Levels of IFN- y release and cytotoxicity for the different CTL lines in 
the absence of epitope ranged from 72-126 pg/ml and 2-6% respectively. 
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Figure 24. Summary of modified minigene constructs used to address 
variables critical for in vivo immunogenicity. The following modifications were 
incorporated into the prototype pMin.l construct; A, deletion of PADRE HTL epitope; B, 
incorporation of the native HBV Pol 551 epitope that contains an alanine in position 9; C, 
deletion of the Ig kappa signal sequence; and D, switching position of the HBV Env 335 
and HBV Pol 455 epitopes. 

Figure 25. Examination of variables that may influence pMin.l 
immunogenicity. In vivo CTL-inducing activity of pMin.l is compared to modified 
constructs. For ease of comparison, the CTL response induced by each of the modified 
DNA minigene constructs (shaded bars) is compared separately in each of the four panels 
to the response induced by the prototype pMiri.i construct (solid bars). The geometnc 
mean response of CTL-positive cultures firom two to five independent experiments are 
shown. Numbers shown with each bar indicate the number of positive cultures/total 
number tested for that particular epitope. The ratio of positive cultures/total tested for the 
pMin.l group is shown in panel A and is the same for the remaining Figure panels (see 
Example V, Materials and Methods, in vitro CTL cultures, for the definition of a positive 
CTL culture). Theradigm responses were obtained by immunizing. animals with the 
lipopeptide and stimulating and testing splenocyte cultures with the HBV Core 18-27 
peptide. 

DEFINITIONS 

An "HTL" peptide epitopeor an "MHC II epitope" is an MHC class H 
restricted epitope, i.e., one that is bound by an MHC class II molecule. 

A "CTL" peptide epitope or an "MHC I epitope" is an MHC class I 
restricted epitope, i.e., one that is bound by an MHC class I molecule. 

An "MHC targeting sequence" refers to a peptide sequence that targets a 
polypeptide, e.g., comprising a peptide epitope, to a cytosolic pathway (e.g., an MHC 
class I antigen processing pathway), en endoplasmic reticulum pathwasy, or an endocytic 
pathway (e.g., an MHC class II antigen processing pathway). 

The term "heterologous" when used with reference to pori;ions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences firom unrelated genes 
arranged to make a new fimctional nucleic acid, e.g., a promoter firom one source and a 
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coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises t%vo or more subsequences that are not found in the same relationship to 
each other in nature, e.g., a fusion polypeptide comprising subsequence firom different 
polypeptides, peptide epitopes from the same polypeptide that are not naturally in an 
adjacent position, or repeats of a single peptide epitope. 

As used herein, the term "universal MHC class II epitope" or a "imiversal 
HTL epitope" refers to a MHC class II peptide epitope that binds to gene products of 
multiple MHC class II alleles. For example, the DR, DP and DQ alleles are human MHC 
n alleles. Generally, a unique set of peptides binds to a particular gene product of a MHC 
class II allele. In contrast, a universal MHC class II epitope is able to bind to gene 
products of muliiple MHC class II alleles A universal MHC class 11 epitope binds to 2 or 
more MHC class II alleles, generally 3 or more MHC class 11 alleles, and particularly 5 or 
more MHC class 11 alleles. Thus, the presence of a universal MHC class n epitope in an 
expression vector is advantageous in that it functions to increase the number of allelic 
MHC class II molecules that can bind to the peptide and, consequently, the number of 
Helper T lymphocytes that are activated. 

Universal MHC class II epitopes are well known in the art and include, for 
example, epitopes such as the "pan DR epitopes," also referred to as "PADRE" 
(Alexander et ai. Immunity 1:751-761 (1994); WO 95/07707, USSN 60/036,713, USSN 
60/037,432, PCT/'US98/01373, 09/009,953, and USSN 60/087,192 each of which is 
incorporated herein by reference). A "pan DR binding peptide" or a "PADRE" peptide of 
the invention is a peptide capable of binding at least about 7 different DR molecules, 
preferably 7 of the 12 most common DR molecules, most preferably 9 of die 12 most 
common DR molecules (DRl, 2w2b, 2w2a, 3, 4w4, 4wl4, 5, 7, 52a, 52b, 52c, and 53), or 
altematively, 50% of a panel of DR molecules representative of greater than or equal to 
75% of the human population, preferably greater than or equal to 80% of the human 
population. Pan DR epitopes can bind to a number of DR alleles and are strongly 
immunogenic for T cells. For example, pan DR epitopes were found to be more effective 
at inducing an immune response than natural MHC class n epitopes (Alexander, supra). 
An example of a PADRE epitope is the peptide 

AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38) (for additional 
examples of PADRE epitopes, see Table 8 of TTC docket No. 018623-006221, filed May 
12, 1999, USSN , herein incorporated by reference in its entirety). 
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With regard to a particular amino acid sequence, an "epitope" is a set of 
amino acid residues which is involved in recognition by a particular immunoglobulin, or 
in the context of T cells, those residues necessary for recognition by T cell receptor 
proteins and/or Major Histocompatibility Complex (MHC) receptors. In an immune 
5 system setting, in vivo or in vitro, an epitope is the collective features of a molecule, such 
as primary, secondary and tertiary peptide structure, and charge, that together form a site 
recognized by an immunoglobulin, T cell receptor or HLA molecule. Throughout this 
disclosure epitope and peptide are often used interchangeably. It is to be appreciated, 
however, that isolated or purified protein or peptide molecules larger than and comprising 
1 0 an epitope of the invention are still within the bounds of the invention. 

As used herein, "high affinity'' with respect to HLA class I molecules is 
defined as binding with an IC50 (or Kd) of less than 50 nM. "Intermediate affinit>'" is 
binding with an IC50 (or Kd) of between about 50 and about 500 nM. "High affinity" 
with respect to binding to HLA class II molecules is defined as binding with an Kd of 
15 less than 100 nM. "Intermediate affinity" is binding with a Kd of between about 100 and 
about 1000 nM. Assays for determining binding are described in detail, e.g., in PCT 
publications WO 94/20127 and WO 94/03205. Altematively, binding is expressed 
relative to a reference peptide. As a particular assay becomes more, or less, sensitive, the 
IC50s of the peptides tested may change somewhat. However, the binding relative to the 
20 reference peptide will not significantly change. For example, in an assay run under 

conditions such that the IC50 of the reference peptide increases 10-fold, the IC50 values 
of the test peptides will also shift approximately 10-fold. Therefore, to avoid ambiguities, 
the assessment of whether a peptide is a good, intermediate, weak, or negative binder is 
generally based on its 1C50, relative to the IC50 of a standard peptide. 
25 Throughout this disclosure, results are expressed in terms of "IC50s." 

IC50 is the concentration of peptide in a binding assay at which 50% inhibition of binding 
of a reference peptide is observed. Given the conditions in which the assays are mn (i.e., 
limiting HLA proteins and labeled peptide concentrations), these values approximate KD 
values. It should be noted that IC50 values can change, often dramatically, if the assay 
30 conditions are varied, and depending on the particular reagents used (e.g., HLA 

preparation, etc.). For example, excessive concentrations of HLA molecules will increase 
the apparent measured IC50 of a given ligand. 

The terras "identical" or percent "identity," in the context of two or more 
peptide sequences, refer to two or more sequences or subsequences that are the same or 
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have a specified percentage of amino acid residues that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measwed using a 
sequence comparison algorithms using default program parameters or by manual 
alignment and visual inspection. 
5 The phrases "isolated" or "biologically pure" refer to material which is 

substantially or essentially free from components which normally accompany the material 
as it is found in its native state. Thus, isolated peptides in accordance with the invention 
preferably do not contain materials normally associated with the peptides in their in situ 
environment. 

10 "Major histocompatibiUty complex" or "MHC" is a cluster of genes that 

plays a role in control of the cellular interactions responsible for physiologic immune 
responses. In humans, the MHC complex is also known as the HLA complex. For a 
detailed description of the MHC and HLA complexes, see Paul, Fundamental 
Immunology (3Td ed. 1993). 

15 "Human leukocyte antigen" or "HLA" is a human class I or class II major 

histocompatibility complex (MHC) protein {see, e.g., Stites, et al.. Immunology, (8th ed., 
1994). 

An "HLA supertype or family", as used herein, describes sets of HLA 
molecules grouped on the basis of shared peptide-binding specificities. HLA class I 

20 molecules that share somewhat similar binding affinity for peptides bearing certain amino 
acid motifs are grouped into HLA supertypes. The terms HLA superfamily, HLA 
supertype family, HLA family, and HLA xx-like supertype molecules (where xx denotes 
a particular HLA type), are synonjmis. 

The term "motif refers to the pattem of residues in a peptide of defined 

25 length, usually a peptide of from about 8 to about 13 amino acids for a class I HLA motif 
and from about 6 to about 25 amino acids for a class 11 HLA motif, which is recognized 
by a particular HLA molecule. Peptide motifs are typically different for each protein 
encoded by each human HLA allele and differ in the pattem of the primary and secondary 
anchor residues. 

30 A "supermotif ' is a peptide binding specificity shared by HLA molecules 

encoded by two or more HLA alleles. Thus, a preferably is recognized with high or 
intermediate affinit\' (as defined herein) by two or more HLA antigens. 

"Cross-reactive binding" indicates that a peptide is bound by more than 
one HLA molecule; a s3aionym is degenerate binding. 
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The terni "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other, typically by peptide bonds between the a-amino and carboxyl groups of 
adjacent amino adds. The preferred CTL-inducing oligopet)tides of the invention are 13 
5 residues or less in length and usually consist of between about 8 and about 1 1 residues, 
preferably 9 or 10 residues. The preferred HTL-inducing oligopeptides are less than 
about 50 residues in length and usually consist of between about 6 and about 30 residues, 
more usually between about 12 and 25, and often between about 15 and 20 residues. 

An "immunogenic peptide" or "peptide epitope" is a peptide which 
1 0 comprises an allele-specific motif or supermotif such that the peptide will bind an HLA 
molecule and induce a CTL and/or HTL response. Thus, immuriogemc peptides of the 
invention are capable of binding to an appropriate HLA molecule and thereafter inducing 
a cytotoxic T cell response, or a helper T cell response, to the antigen from which the 
immunogenic peptide is derived. 
15 A "protective immune response" refers to a CTL and/or an HTL response 

to an antigen derived from an infectious agent or a tumor antigen, which prevents or at 
least partially arrests disease symptoms or progression. The immune response may also 
include an antibody response which has been facilitated by the stimulation of helper T 
cells. 

20 The term "residue" refers to an amino acid or amino acid mimetic 

incorporated into an oligopeptide by an amide bond or amide bond mimetic. 

"S\-nthetic peptide" refers to a peptide that is not naturally occurring, but is 
man-made using such methods as chemical synthesis or recombinant DNA technology. 
The nomenclature used to describe peptide compounds follows the 
25 conventional practice wherein the amino group is presented to the left (the N-tenninus) 
and the carboxyl group to the right (the C-terminus) of each amino acid residue. When 
amino acid residue positions are referred to in a peptide epitope they are numbered in an 
amino to carboxyl direction with position one being the position closest to the amino 
terminal end of the epitope, or the peptide or protein of which it may be a part. In the 
30 formulae representing selected specific embodiments of the present invention, the amino- 
and caiboxyl-terminal groups, although not specifically shown, are in the form they 
would assume at physiologic pH values, unless otherwise specified. In the amino acid 
stmcmre formulae, each residue is generally represented by standard three letter or single 
letter designations. The L-form of an amino acid residue is represented by a capital single 
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letter or a capital first letter of a three-letter symbol, and the D-form for those amino acids 
having D-forms is represented by a lower case single letter or a lower case three letter 
symbol. Glycine has no asymmetric carbon atom and is simply referred to as "Gly" or G. 

As used herein, the term "expression vector" is intended to refer to a 
nucleic acid molecule capable of expressing an antigen of interest such as a MHC class I 
or class n epitope in an appropriate target cell. An expression vector can be, for example, 
a plasmid or virus, including DNA or RNA viruses. The expression vector contains such 
a promoter element to express an antigen of interest in the appropriate cell or tissue in 
order to stimulate a desired immune response. 

DETAILED DESCRIPTION OF THE IinVcNTION 
Cytotoxic T lymphocytes (CTLs) and helper T lymphocytes (HTLs) are 
critical for immunir>- against infectious pathogens; such as viruses, bacteria, and protozoa; 
tumor cells; autoimraunne diseases and the like. The present invention provides 
minigenes that encode peptide epitopes which induce a CTL and/or HTL response. The 
minigenes of the im ention also include an MHC targeting sequence. A variety of 
minigenes encoding different epitopes can be tested for immunogenicity using an HLA 
transgenic mouse. The epitopes are typically a combination of at least two or more HTL 
epitopes, or a CTL epitope plus a universal HTL epitope, and optinally include additional 
HTl and/or CTL epitopes. Two, three, four, five, six, seven, eight, nine, ten, twenty, 
thirty, forty or about fifty different epitopes, either HTL and/or CTL, can be included in 
the minigene, along with the MHC targeting sequence. The epitopes can have different 
HLA restriction. Epitopes to be tested include those derived from viruses such as HIV, 
HBV, HCV, HSV, CMV, HPV, and HTLV; cancer antigens such as p53, Her2/Neu, 
MAGE, PSA, human papilloma virus, and CEA; parasites such as Trypanosoma. 
Plasmodium, Leishmania. Giardia, Entamoeba; autoimmune diseases such as rheumatoid 
arthritis, myesthenia gravis, and lupus erythematosus; fungi such as Aspergillus and 
Candida; and bacteria such as Escherichia coli, Staphylococci. Chlamydia. Mycobacteria. 
Streptococci, and Pseudomonas. The epitopes to be encoded by the minigene are selected 
and tested using the methods described in published PCT applications WO 93/07421, WO 
94/02353, WO 95/01000, WO 97/04451, and WO 97/05348, herein incorporated by 
reference. 
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HTL and CTL Epitopes 

The expression vectors of the invention encode one or more MHC class 11 
and/or class I epitopes and an MHC targeting sequence. Multiple MHC class 11 or class I 
epitopes present in an expression vector can be derived from the same antigen, or the 
5 MHC epitopes can be derived from different antigens. For example, an expression vector 
can contain one or more MHC epitopes that can be derived from two different antigens of 
the same vims or from two different antigens of different viruses. Furthermore, any 
MHC epitope can be used in the expression vectors of the invention. For example, any 
single MHC epitope or a combination of the MHC epitopes shown in Tables 1 to 8 can be 

10 used in the expression vectors of the invention. Other peptide epitopes can be selected by 
one of skill in the an, e.g., by using a computer to select epitopes that contain HLA allele- 
specific motifs or supermotifs. The expression vectors of the invention can also encode 
one or more universal MHC class II epitopes, e.g., PADRE (see. e.g., SEQ ID NO:38 and 
Table 8 of TTC docket No. 018623-006221, filed May 12, 1999, USSN 

15 ). 

Universal MHC class II epitopes can be advantageously combined with 
other MHC class I and class II epitopes to increase the number of cells that are activated 
in response to a given antigen and provide broader population coverage of MHC-reactive 
alleles. Thus, the expression vectors of the invention can encode MHC epitopes specific 

20 for an antigen, universal MHC class II epitopes, or a combination of specific MHC 
epitopes and at least one universal MHC class II epitope. 

MHC class I epitopes are generally about 5 to 15 amino acids in length, in 
particular about 8 to 11 amino acids in length. MHC class n epitopes are generally about 
10 to 25 amino acids in length, in particular about 13 to 21 amino acids in length. A 

25 MHC class I or II epitope can be derived from any desired antigen of interest. The 
antigen of interest can be a viral antigen, surface receptor, tumor antigen, oncogene, 
enzyme, or any pathogen, cell or molecule for which an immune response is desired. 
Epitopes can be selected based on their ability to bind one or multiple HLA alleles, and 
can also be selected using the "analog" technique described below. 

30 

Targeting Sequences 

The expression vectors of the invention encode one or more IVffiC epitopes 
operably linked to a MHC targeting sequence. The use of a MHC targeting sequence 
enhances the immune response to an antigen, relative to delivery of antigen alone, by 
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directing the peptide epitope to the site of MHC molecule assembly and transport to the 
cell surface, thereby providing an increased number of MHC molecule-peptide epitope 
complexes available for binding to and activation of T cells. 

MHC class I targeting sequences are used in the present invention, e.g., 
5 those sequences that target an MHC class I epitope peptide to a cytosolic pathway or to 
the endoplasmic reticulum {see, e.g., Rammensee et al.. Immunogenetics 41 : 178-228 
(1995)). For example, the cytosolic pathway processes endogenous antigens that are 
expressed inside the cell. Although not wishing to be bound by any particular theory, 
cytosolic proteins are thought to be at least partially degraded by an endopeptidase 

1 0 activity of a proteasome and then transported to the endoplasmic reticulum by the TAP 
molecule (transporter associated with processing). In the endoplasmic reticulum, the 
antigen binds to MHC class I molecules. Endoplasmic reticulum signal sequences bypass 
the cytosolic processing pathway and directly target endogenous antigens to the 
endoplasmic reticulum, where proteolytic degradation into peptide fragments occurs. 

15 Such MHC class I targeting sequences are well known in the art, and include, e.g., signal 
sequences such as those from Ig kappa .tissue plasminogen activator or insulin. A 
preferred signal peptide is the human Ig kappa chain sequence. Endoplasmic reticulum 
signal sequences can also be used to target MHC class 11 epitopes to the endoplasmic 
reticulum, the site of MHC class I molecule assembly. 

20 MHC class II targeting sequences are also used in the invention, e.g., those 

that target a peptide to the endocytic pathway. These targeting sequences typically direct 
extracellular antigens to enter the endocytic pathway, which results in the antigen being 
transferred to the lysosomal compartment where the antigen is proteolytically cleaved 
into antigen peptides for binding to MHC class 11 molecules. As with the normal 

25 processing of exogenous antigen, a sequence that directs a MHC class 11 epitope to the 
endosomes of the endocytic pathway and/or subsequently to lysosomes, where the MHC 
class n epitope can bind to a MHC class U molecule, is a MHC class 11 targeting 
sequence. For example, group of MHC class 11 targeting sequences useful in the 
invention are lysosomal targeting sequences, which localize polypeptides to lysosomes. 

30 Since MHC class n molecules typically bind to antigen peptides derived from proteolytic 
processing of endocytosed antigens in lysosomes, a lysosomal targeting sequence can 
flmction as a MHC class n targeting sequence. Lysosomal targeting sequences are well 
known in the art and include sequences found in the lysosomal proteins LAMP-1 and 
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LAMP-2 as described by August et al. (U.S. Patent No. 5,633,234, issued May 27, 1997), 
which is incorporated herein by reference. 

Other lysosomal proteins that contain lysosomal targeting sequences 
include HLA-DM. HLA-DM is an endosomal/lysosomal protein that functions in 
5 facilitating binding of antigen peptides to MHC class 11 molecules. Since it is located in 
the lysosome, HLA-DM has a lysosomal targeting sequence that can function as a MHC 
class II molecule targeting sequence (Copier a/.. J. Immunol. 157:1017-1027 (1996), 
which is incorporated herein by reference). 

The resident lysosomal protein HLA-DO can also function as a lysosomal 

10 targeting sequence. In contrast to the above described resident lysosomal proteins 

LAivIF-i and HLA-Dm, wiiich encode specific Tyi -containing motifs that target proteins 
to lysosomes, HLA-DO is targeted to lysosomes by association with HLA-DM (Liijedahl 
et al, EMBOJ. 15:4817-4824 (1996)), which is incorporated herein by reference. 
Therefore, the sequences of HLA-DO that cause association with HLA-DM and, 

1 5 consequently, translocation of HLA-DO to lysosomes can be used as MHC class n 

targeting sequences. Similarly, the murine homolog of HLA-DO, H2-D0, can be used to 
derive a MHC class 11 targeting sequence. A MHC class II epitope can be fused to HLA- 
DO or H2-D0 and targeted to lysosomes. 

In another example, the cytoplasmic domains of B cell receptor subunits 

20 Ig-a and Ig-p mediate antigen intemalization and increase the efficiency of antigen 

presentation (Bonnerot et al, Immunity 3:335-341 (1995)), which is incorporated herein 
by reference. Therefore, the cytoplasmic domains of the Ig-a and Ig-P proteins can 
function as MHC class II targeting sequences that target a MHC class II epitope to the 
endocytic pathway for processing and binding to MHC class II molecules. 

25 Another example of a MHC class n targetmg sequence that directs MHC 

class n epitopes to the endocytic pathway is a sequence that directs polypeptides to be 
secreted, where the polypeptide can enter the endosomal pathway. These MHC class 11 
targeting sequences that direct polypeptides to be secreted mimic the normal pathway by 
which exogenous, extracellular antigens are processed into peptides that bind to MHC 

30 class II molecules. Any signal sequence that functions to direct a polypeptide through the 
endoplasmic reticulum and ultimately to be secreted can function as a MHC class 11 
targeting sequence so long as the secreted polypeptide can enter the endosomal/lysosomal 
pathway and be cleaved into peptides that can bind to MHC class n molecules. An 
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example of such a fusion is shown in Fig\ire 11, where the signal sequence of kappa 
immunoglobulin is fused to multiple MHC class II epitopes. 

In another example, the li protein binds to MHC class n molecules in the 
endoplasmic reticulum, where it functions to prevent peptides present in the endoplasmic 
5 reticulum from binding to the MHC class 11 molecules. Therefore, fusion of a MHC class 
II epitope to the li protein targets the MHC class 11 epitope to the endoplasmic reticulum 
and a MHC class II molecule. For example, the CLIP sequence of the li protein can be 
removed and replaced with a MHC class 11 epitope sequence so that the MHC class 11 
epitope is directed to the endoplasmic reticulum, where the epitope binds to a MHC class 
10 II molecule. 

In some cases, aniigens tiiemsclves can serve as JviIIC class II or I 
targeting sequences and can be fused to a imiversal MHC class II epitope to stimulate an 
immune response, .\lthough cytoplasmic viral antigens are generally processed and 
presented as complexes with MHC class I molecules, long-lived cytoplasmic proteins 

1 5 such as the influenza matrix protein can enter the MHC class 11 molecule processing 

pathway (Gueguen & Long, Proc. Natl. Acad. Sci. USA 93:14692-14697 (1996)), which 
is incorporated herein by reference. Therefore, long-lived cytoplasmic proteins can 
function as a MHC class II targeting sequence. For example, an expression vector 
encoding influenza matrix protein fused to a universal MHC class II epitope can be 

20 advantageously used to target influenza antigen and the universal MHC class 11 epitope to 
the MHC class II pathway for stimulating an rmmime response to influenza. 

Other examples of antigens functioning as MHC class n targeting 
sequences include polypeptides that spontaneously form particles. The polypeptides are 
secreted from the cell that produces them and spontaneously form particles, which are 

25 taken up into an antigen-presenting cell by endocytosis such as receptor-mediated 

endocytosis or are engulfed by phagocytosis. The particles are proteolytically cleaved 
into antigen peptides after entering the endosoraal/lysosomal pathway. 

One such polypeptide that spontaneously forms particles is HBV surface 
antigen (HBV-S) (Diminsky et ai. Vaccine 15:637-647 (1997); Le Borgne et al. 

30 Virology 240:304-3 1 5 (1998)), each of which is incorporated herein by reference. 

Another polypeptide that spontaneously fonns particles is HBV core antigen (Kuhrober et 
al. International Immunol. 9:1203-1212 (1997)), which is incorporated herein by 
reference. Still another polypeptide that spontaneously forms particles is the yeast Ty 
protein (Weber et ai. Vaccine 13:831-834 (1995)), which is incorporated herein by 
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reference. For example, an expression vector containing HBV-S antigen fused to a 
universal MHC class H epitope can be advantageously used to target HBV-S antigen and 
the universal MHC class H epitope to the MHC class H pathway for stimulating an 
immune response to HBV. 

Binding Affinity of Peptide Epitopes for HLA Molecules 

The large degree of HLA polymorphism is an important factor to be taken 
into account with the epitope-based approach to vaccine development. To address this 
factor, epitope selection encompassing identification of peptides capable of binding at 
high or intermediate affinity to multiple HLA molecules is preferably utilized, most 
preferably these epitopes bind at liigh or mtermediate affinit>' to two or more allele 
specific HLA molecules. 

CTL-inducing peptides of interest for vaccine compositions preferably 
include those that have a binding affinity for class I HLA molecules of less than 500 nM. 
HTL-inducing peptides preferably include those that have a binding affinity for class H 
HLA molecules of less than 1000 nM. For example, peptide binding is assessed by 
testing the capacity of a candidate peptide to bind to a purified HLA molecule in vitro. 
Peptides exhibiting high or intermediate affinity are then considered for further analysis. 
Selected peptides are tested on other members of the supertype family. In preferred 
embodiments, peptides that exhibit cross-reactive binding are then used in vaccines or in 
cellular screening analyses. 

Higher HLA binding affinity is typically correlated with greater 
immunogenicity. Greater immunogenicity can be manifested in several different ways. 
Immunogenicity corresponds to whether an immune response is eUcited at all, and to the 
vigor of any particular response, as well as to the extent of a population in which a 
response is elicited. For example, a peptide might elicit an immune response in a diverse 
array of the population, yet in no instance produce a vigorous response. In accordance 
with these principles, close to 90% of high binding peptides have been found to be 
immunogenic, as contrasted with about 50% of the peptides which bind with intermediate 
I affinity. Moreover, higher binding affinity peptides leads to more vigorous immunogenic 
responses. As a result, less peptide is required to elicit a similar biological effect if a high 
affinity binding peptide is used. Thus, in preferred embodiments of the invention, high 
binding epitopes are particularly useful. 
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The relationship between binding afEnity for HLA class I molecules and 
immunogenicity of discrete peptide epitopes on bound antigens has been determined for 
the first time in the art by the present inventors. The correlation between binding affinity 
and immunogenicity was analyzed in two different experimental approaches (Sette et ai, 
5 J. Immunol. 153:5586-5592 (1994)). In the first approach, the immunogenicity of 

potential epitopes ranging in HLA binding affmity over a 10,000-fold range was analyzed 
in HLA-A*0201 transgenic mice. In the second approach, the antigenicity of 
approximately 100 different hepatitis B virus (HBV)-derived potential epitopes, all 
carrying A*0201 binding motifs, was assessed by using PBL (peripheral blood 

10 lymphocytes) firom acute hepatitis patients. Pursuant to these approaches, it was 

determined that an affinity thj-eshold of approximately 500 nM (preferably 50 nM or less) 
determines the capacity of a peptide epitope to elicit a CTL response. These data are true 
for class I binding affinity measurements for naturally processed peptides and for 
synthesized T cell epitopes. These data also indicate the important role of deteraiinant 

15 selection in the shaping of T cell responses {see, e.g., Schaeffer et al. Proc. Natl. Acad. 
Sci. USA 86:4649-4653, 1989). 

An affinity threshold associated with immunogenicity in the context of 
HLA class II DR molecules has also been delineated {see, e.g., Southwood et al. J. 
Immunology 160:3363-3373 (1998), and USSN 60/087192, filed 5/29/98). In order to 

20 defme a biologically significant threshold of DR binding affinity, a database of the 

binding affinities of 32 DR-restricted epitopes for their restricting element (i.e., the HLA 
molecule that binds the motif) was compiled. In approximately half of the cases (15 of 32 
epitopes), DR restriction was associated with high binding affinities, i.e. binding affinities 
of less than 100 nM. In the other half of the cases (16 of 32), DR restriction was 

25 associated with intermediate affinity (binding affinities in the 100-1000 nM range). In 
only one of 32 cases was DR restriction associated with an IC50 of 1000 nM or greater. 
Thus, 1000 nM can be defined as an affinity threshold associated vidth immunogenicity in 
the context of DR molecules. 

30 Peptide Epitope Binding Motifs and Supermotifs 

In the past few years evidence has accimiulated to demonstrate that a large 
firaction of HLA class I and class n molecules can be classified into a relatively few 
supertypes, each characterized by largely overlapping peptide binding repertoires, and 
consensus structures of the main peptide binding pockets. 
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For HLA molecule pocket analyses, the residues comprising the B and F 
pockets of HLA class I molecules as described in cryst'allographic studies were analyzed 
(Guo et al. Nature 360:364 (1992); Saper et al, J. Mol. Biol. 219:277 (1991); Madden et 
al. Cell 75:693 (1993); Parham et al. Immunol. Rev. 143:141 (1995)). In these analyses, 
residues 9, 45, 63, 66, 67, 70, and 99 were considered to make up the B pocket; and the B 
pocket was deemed to determine the specificity for the amino acid residue in the second 
position of peptide ligands. Similarly, residues 77, 80, 81, and 116 were considered to 
determine the specificity of the F pocket; the F pocket was deemed to determine the 
specificity for the C-terminal residue of a peptide Ugand bound by the HLA class I 
molecule. 

Through the study of single amino acid substituted antigen analogs and the 
sequencing of endogenously bound, naturally processed peptides, critical residues 
required for allele-specific binding to HLA molecules have been identified. The presence 
of these residues correlates with binding affinity for HLA molecules. The identification 
i of motifs and/or supermotifs that correlate with high and intermediate affinity binding is 
an important issue with respect to the identification of immunogenic peptide epitopes for 
the inclusion in a vaccine. Kast et al. (/ Immunol. 152:3904-3912 (1994)) have shown 
that motif-bearing peptides account for 90% of the epitopes that bind to allele-specific 
HLA class I molecules. In this study all possible peptides of 9 amino acids in length and 
0 overiapping by eight amino acids (240 peptides), which cover the entire sequence of the 
E6 and E7 proteins of human papillomavirus type 1 6, were evaluated for binding to five 
allele-specific HLA molecules that are expressed at high firequency among different 
ethnic groups. This unbiased set of peptides allowed an evaluation of the predictive value 
of HLA class I motifs. From the set of 240 peptides, 22 peptides were identified that 
,5 bound to an allele-specific HLA molecules with high or intermediate affinity. Of these 
22 peptides, 20, (i.e., 91%), were motif-bearing. Thus, this study demonstrates the value 
of motifs for the identification of peptide epitopes for inclusion in a vaccine: appUcation 
of motif-based identification techniques eliminates screening of 90% of the potential 
epitopes in a target antigen protein sequence. 
50 Peptides of the present invention may also include epitopes that bind to 

MHC class n DR molecules. There is a significant difference between class I and class H 
' HLA molecules. This difference corresponds to the fact that, although a stringent size 
restriction and motif position relative to the binding pocket exists for peptides that bind to 
class I molecules, a greater degree of heterogeneity in both size and binding firame 
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position of the motif, relative to the N and C tennini of the peptide, exists for class II 
peptide ligands. 

This increased heterogeneity of HLA class 11 peptide ligands is due to the 
structure of the binding groove of the HLA class II molecule which, unlike its class I 

5 counterpart, is open at both ends. Crystallographic analysis of HLA class U DRB*0101- 
peptide complexes showed that the residues occupying position 1 and position 6 of 
peptides complexed with DRB*0101 engage two complementary pockets on the 
DRBa*0101 molecules, with the PI position corresponding to the most crucial anchor 
residue and the deepest hydrophobic pocket {see, e.g., Madden, Ann. Rev. Immunol. 

10 13:587 (1995)). Other studies have also pointed to the P6 position as a crucial anchor 
residue for binding to various other DR molecules. 

Thus, peptides of the present invention are identified by any one of several 
HLA class I or II -specific amino acid motifs {see. e.g.. Tables I-III of USSN 09/226,775, 
and 09/239,043, herein incorporated by reference in their entirety). If the presence of the 

1 5 motif corresponds to the ability to bind several allele-specific HLA antigens it is referred 
to as a supermotif. The allele-specific HLA molecules that bind to peptides that possess a 
particular amino acid superaiotif are collectively referred to as an HLA "supertype." 

Immune Response-Stimulating Peptide Analogs 

20 In general, CTL and HTL responses are not directed against all possible 

epitopes. Rather, they are restricted to a few "immunodominant" determinants 
(Zinkemagel et ai. Adv. Immunol. 27:5159 (1979); Bennink et al., J. Exp. Med. 
168:1935-1939 (1988); Rawle et al, J. Immunol. 146:3977-3984 (1991)). It has been 
recognized that immunodominance (Benacerraf et al. Science 175:273-279 (1972)) could 

25 be explained by either the ability of a given epitope to selectively bind a particular HLA 
protein (determinant selection theory) (Vitiello etal. J. Immunol 131:1635 (1983)); 
Rosenthal et al. Nature 267: 156-158 (1 977)), or being selectively recognized by the 
existing TCR (T cell receptor) specificity (repertoire theory) (Klein, Immunology, The 
Science of Self on self Discrimination, pp. 270-310 (1982)). It has been demonstrated that 

30 additional factors, mostly linked to processing events, can also play a key role in 

dictating, beyond strict immunogenicity, which of the many potential determinants will 
be presented as immunodominant (Sercarz et al, Annu. Rev. Immunol. 11: 729-766 
(1993)). 
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The concept of dominance and subdominance is relevant to 
immimotherapy of both infectious diseases and cancer. For example, in the course of 
chronic viral disease, recruitment of subdominant epitopes can be important for 
successful clearance of the infection, especially if dominant CTL or HTL specificities 
have been inactivated by functional tolerance, suppression, mutation of viruses and other 
mechanisms (Franco et al.. Curr. Opin. Immunol. 7:524-531 (1995)). In the case of 
cancer and tumor antigens, CTLs recognizing at least some of the highest binding affinity 
peptides might be fimctionally inactivated. Lower binding affmity peptides are 
preferentially recognized at these times, and may therefore be preferred in therapeutic or 
prophylactic anti-cancer vaccines. 

In particular, it has been noted that a significant number of epitopes 
derived fi-om known non-viral tumor associated antigens (TAA) bind HLA class I with 
intermediate affinity (IC50 in the 50-500 nM range). For example, it has been found that 
8 of 15 known TAA peptides recognized by tumor infiltrating lymphocytes (TIL) or CTL 
bound in the 50-500 nM range. (These data are in contrast with estimates that 90% of 
known viral antigens were bound by HLA class I molecules with IC50 of 50 nM or less, 
while only approximately 10% bound in the 50-500 nM range (Sette etaL. J. Immunol., 
153:558-5592 (1994)). In the cancer setting this phenomenon is probably due to 
elimination, or fimctional inhibition of the CTL recognizing several of the highest binding 
peptides, presumably because of T cell tolerization events. 

Without intending to be bound by theory, it is believed that because T cells 
to dominant epitopes may have been clonally deleted, selecting subdominant epitopes 
may allow extant T cells to be recruited, which will then lead to a therapeutic or 
prophylactic response. However, the binding of HLA molecules to subdominant epitopes 
is often less vigorous than to dominant ones. Accordingly, there is a need to be able to 
modulate the binding affmity of particular immunogenic epitopes for one or more HLA 
molecules, and thereby to modulate the immune response elicited by the peptide, for 
example to prepare analog peptides which elicit a more vigorous response. This abihty 
would greatly enhance the usefiibess of peptide-based vaccines and therapeutic agents. 

Thus, although peptides with suitable cross-reactivity among all alleles of 
a superfamily are identified by the screening procedures described above, cross-reactivity 
is not always as complete as possible, and in certain cases procedures to further increase 
cross-reactivity of peptides can be usefiil; moreover, such procedures can also be used to 
modify other propenies of the peptides such as binding affinity or peptide stability. 
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Having established the general rules that govern cross-reactivity of peptides for HLA 
alleles within a given motif or supermotif, modification (i.e., analoging) of the structure 
of peptides of particular interest in order to achieve broader (or otherwise modified) HLA 
binding capacity can be performed. More specifically, peptides which exhibit the 
broadest cross-reactivity patterns, can be produced in accordance with the teachings 
herein. The present concepts related to analog generation are set forth in greater detail in 
co-pending USSN 09/226,775. 

In brief, the strategy employed utilizes the motifs or supermotifs which 
conrelate with binding to certain HLA class I and 11 molecules. The motifs or supermotifs 
are defined by having primary anchors, and in many cases secondary anchors (see Tables 
I-III of USSN 09/226,775). Ajialog peptides can be created by substituting amino acids 
residues at primary anchor, secondary anchor, or at primary and secondary anchor 
positions. Generally, analogs are made for peptides that already bear a motif or 
supermotif Preferred secondary anchor residues of supermotifs and motifs that have 
been defined for HLA class I and class n binding peptides are shown in Tables II and III, 
respectively, of USSN 09/226,775. 

For a number of the motifs or supermotifs in accordance with the 
invention, residues are defined which are deleterious to binding to allele-specific HLA 
molecules or members of HLA supertypes that bind to the respective motif or supermotif 
{see Tables 11 and m of USSN 09/226,775). Accordingly, removal of such residues that 
are detrimental to binding can be performed in accordance with the methods described 
therein. For example, in the case of the A3 supertype, when all peptides that have such 
deleterious residues are removed fi-om the population of analyzed peptides, the incidence 
of cross-reactivity increases fi-om 22% to 37% (I., Sidney et al., Hu. Immunol. 45:79 
(1996)). Thus, one strategy to improve the cross-reactivity of peptides within a given 
supenmotif is simply to delete one or more of the deleterious residues present within a 
peptide and substitute a small "neutral" residue such as Ala (that may not influence T cell 
recognition of the peptide). An enhanced likehhood of cross-reactivity is expected if, 
together with elimination of detrimental residues within a peptide, "preferred" residues 
associated with high affinity binding to an allele-specific HLA molecule or to multiple 
HLA molecules within a superfamily are inserted. 

To ensure that an analog peptide, when used as a vaccine, actually elicits a 
CTL response to the native epitope in vivo (or, in the case of class II epitopes, a failure to 
elicit helper T cells that cross-react with the wild type peptides), the analog peptide may 
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be used to inununize T cells in vitro fronTindividuals of the appropriate HLA allele. 
Thereafter, the immunized cells' capacity to induce lysis of wild type peptide sensitized 
target cells is evaluated. In both class I and class n systems it will be desirable to use as 
targets, cells that have been either infected or transfected with the appropriate genes to 
5 establish whether endogenously produced antigen is also recognized by the relevant T 
cells. 

Another embodiment of the invention is to create analogs of weak binding 
peptides, to thereby ensure adequate numbers of cross-reactive cellular binders. Class I 
peptides exhibiting binding affinities of 500-50000 nM, and carrying an acceptable but 

1 0 suboptimal primary anchor residue at one or both positions can be "fixed" by substituting 
preferred anchor residues in accordance with the respective supertype. The analog 
peptides can then be tested for crossbinding activity. 

Another embodiment for generating effective peptide analogs involves the 
substitution of residues that have an adverse impact on peptide stability or solubility in, 

15 e.g., a liquid environment. This substitution may occur at any position of the peptide 
epitope. For example, a cysteine (C) can be substituted out in favor of gamma-amino 
butyric acid. Due to its chemical nature, cysteine has the propensity to form disulfide 
bridges and sufficiently alter the peptide structurally so as to reduce binding capacity. 
Substituting gamma-amino butyric acid for C not only alleviates this problem, but 

20 actually improves binding and crossbinding capabiUty in certain instances (Sette et al. In: 
Persistent Viral Infections (Ahmed & Chen, eds., 1998)). Substitution of cysteine with 
gamma-amino butyric acid may occur at any residue of a peptide epitope, i.e., at either 
anchor or non-anchor positions. 

25 Expression Vectors and Construction of a Minigene 

The expression vectors of the invention contain at least one promoter 
element that is capable of expressing a transcription unit encoding the antigen of interest, 
for example, a MHC class I epitope or a MHC class 11 epitope and an MHC targeting 
sequence in the appropriate cells of an organism so that the antigen is expressed and 
30 targeted to the appropriate MHC molecule. For example, if the expression vector is 
administered to a mammal such as a human, a promoter element that functions in a 
human cell is incorporated into the expression vector. An example of an expression 
vector usefiil for expressing the MHC class 11 epitopes fiised to MHC class 11 targeting 
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sequences and the MHC class I epitopes described herein is the pEP2 vector described in 
Example IV. 

This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
5 Sambrook et ai. Molecular Cloning. A Laboratory Manual (2nd ed. 1 989); Kiiegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology (Ausubel et al., eds., 1994); Oligonucleotide Synthesis: A Practical 
Approach (Gait, ed., 1984); Kuijpers, Nucleic Acids Research 18(17):5197 (1994); 
Dueholm, J. Org. Chem. 59:5767-5773 (1994); Methods in Molecular Biology, volume 

10 20 (Agrawal, ed.); and Tijssen, Laboratory Techniques in Biochemistry and Molecular 
BiOiOgy—Hybridizaiion -with Nucleic Acid Probes, e.g., Fart I, chapter 2 "Overview of 
principles of hybridization and the strategy of nucleic acid probe assays" (1993)). 

The minigenes are comprised of two or many different epitopes {see, e.g.. 
Tables 1-8). The nucleic acid encoding the epitopes are assembled in a minigene 

1 5 according to standard techniques. In general, the nucleic acid sequences encoding 
minigene epitopes are isolated using amplification techniques with oligonucleotide 
primers, or are chemically synthesized. Recombinant cloning techniques can also be used 
when appropriate. Oligonucleotide sequences are selected which either amplify (when 
using PGR to assemble the minigene) or encode (when using synthetic oligonucleotides to 

20 assemble the minigene) the desired epitopes. 

Amplification techniques using primers are typically used to amplify and 
isolate sequences encoding the epitopes of choice from DNA or UNA {see U.S. Patents 
4,683,195 and 4,683,202; PCR Protocols: A Guide to Methods and Applications (Innis et 
al, eds, 1990)). Methods such as polymerase chain reaction (PGR) and hgase chain 

25 reaction (LCR) can be used to amplify epitope nucleic acid sequences directly from 

nxRNA, from cDNA, from genomic libraries or cDNA libraries. Restriction endonuclease 
sites can be incorporated into the primers. Minigenes amplified by the PGR reaction can 
be purified from agarose gels and cloned into an appropriate vector. 

Synthetic oHgonucleotides can also be used to construct minigenes. This 

30 method is performed using a series of overlapping oligonucleotides, representing both the 
sense and non-sense strands of the gene. These DNA fragments are then annealed, 
ligated and cloned. Oligonucleotides that are not commercially available can be 
chemically synthesized according to the soUd phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
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automated synthesizer, as described in Van Devanter et. al. Nucleic Acids Res. 12:6159- 
6168 (1984). Purification of oligonucleotides is by either native acrylamide gel 
electrophoresis or by anion-exchange HPLC as described in Pearson & Reanier, J. 
Chrom. 255:137-149(1983). 
5 The epitopes of the minigene are typically subcloned into an expression 

vector that contains a strong promoter to direct transcription, as well as other regulatory 
sequences such as enhancers and polyadenylation sites. Suitable promoters are well 
known in the art and described, e.g., in Sambrook et al. and Ausubel et al. Eukaryotic 
expression systems for mammalian cells are well known in the art and are commercially 

10 available. Such promoter elements include, for example, cytomegalovirus (CMV), Rous 
sarcoma virus L7R and SV40. 

The expression vector typically contains a transcription unit or expression 
. cassette that contains all the additional elements required for the expression of the 
minigene in host cells. A typical expression cassette thus contains a promoter operably 

15 linked to the minigene and signals required for efficient polyadenylation of the transcript. 
Additional elements of the cassette may include enhancers and introns with functional 
splice donor and acceptor sites. 

In addition to a promoter sequence, the expression cassette can also 
contain a transcriprion termination region downstream of the structural gene to provide 

20 for efficient termination. The termination region may be obtained fi-om the same gene as 
the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 
into the cell is not particularly critical. Any of the conventional vectors used for 
expression in eukar> otic cells may be used. Expression vectors containing regulatory 

25 elements fi^om eukaryotic viruses are typically used in eukaryotic expression vectors, e.g., 
SV40 vectors, papilloma virus vectors, and vectors derived fi-om Epstein Bar virus. Other 
exemplary eukaryotic vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, 
baculovirus pDSVE, and any other vector allowing expression of proteins under the 
direction of the SV40 early promoter, SV40 later promoter, metallothionein promoter, 

30 murine mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin 

promoter, or other promoters shown effective for expression in eukaryotic cells. In one 
embodiment, the vector pEP2 is used in the present invention. 

Other elements that are typically included in expression vectors also 
include a repUcon that functions in E. coli, a gene encoding antibiotic resistance to permit 
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selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
nonessential regions of the plasmid to allow insertion of eiikaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 
genes known in the art are suitable. The prokaryotic sequences are preferably chosen 
5 such that they do not interfere with the replication of the DNA in eukaryotic cells, if 
necessary. 



Administration In Vivo 

The invention also provides methods for stimulating an immune response 

10 by administering an expression vector of the invention to an mdividual. Administration 
of an expression vector of the invention for stimulating an immune response is 
advantageous because the expression vectors of the invention target MHC epitopes to 
MHC molecules, thus increasing the number of CTL and HTL activated by the antigens 
encoded by the expression vector. 

15 Initially, the expression vectors of the invention are screened in mouse to 

determine the expression vectors having optimal activity in stimulating a desired immune 
response. Initial studies are therefore carried out, where possible, with mouse genes of 
the MHC targeting sequences. Methods of determining the activity of the expression 
vectors of the invention are well known in the art and include, for example, the uptake of 

20 ^H-thymidine to measure T cell activation and the release of ^'Cr to measure CTL activity 
as described below in Examples II and HE. Experiments similar to those described in 
Example IV are performed to determine the expression vectors having activity at 
stimulating an immune response. The expression vectors having activity are further 
tested in human. To circumvent potential adverse immunological responses to encoded 

25 mouse sequences, the expression vectors having activity are modified so that the MHC 
class n targeting sequences are derived from himian genes. For example, substitution of 
the analogous regions of the human horaologs of genes containing various MHC class II 
targeting sequences are substituted into the expression vectors of the invention. 
Examples of such human homologs of genes containing MHC class 11 targeting sequences 

30 are shown in Figures 12 to 17. Expression vectors containing human MHC class 11 

targeting sequences, such as those described in Example I below, are tested for activity at 
stimulating an immune response in human. 

The invention also relates to pharmaceutical compositions comprising a 
phaimaceutically acceptable carrier and an expression vector of the invention. 
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Pharmaceutically acceptable carriers are well known in the art and include aqueous or 
non-aqueous solutions, suspensions and emulsions, including physiologically buffered 
saline, alcohol/aqueous solutions or other solvents or vehicles such as glycols, glycerol, 
oils such as olive oil or injectable organic esters. 
5 A pharmaceutically acceptable carrier can contain physiologically 

acceptable compounds that act, for example, to stabilize the expression vector or increase 
the absorption of the expression vector. Such physiologically acceptable compounds 
include, for example, carbohydrates, such as glucose, sucrose or dextrans, antioxidants 
such as ascorbic acid or glutathione, chelating agents, low molecular weight polypeptides, 

10 antimicrobial agents, inert gases or other stabilizers or excipients. Expression vectors can 
additionally be complexed with other components such as peptides, polypeptides and 
carbohydrates. Expression vectors can also be complexed to particles or beads that can 
be administered to an individual, for example, using a vaccine gun. One skilled in the art 
would know that the choice of a pharmaceutically acceptable carrier, including a 

15 physiologically acceptable compound, depends, for example, on the route of 
administration of the expression vector. 

The invention further relates to methods of administering a pharmaceutical 
composition comprising an expression vector of the invention to stimulate an immune 
response. The expression vectors are administered by methods well known in the art as 

20 described in Donnelly et al. {Ann. Rev. Immunol. 15:617-648 (1997)); Feigner et al. (U.S. 
Patent No. 5,580,859, issued December 3, 1996); Feigner (U.S. Patent No. 5,703,055, 
issued December 30, 1997); and Carson et al. (U.S. Patent No. 5,679,647, issued October 
21, 1997), each of which is incorporated herein by reference. In one embodiment, the 
minigene is administered as naked nucleic acid. 

25 A pharmaceutical composition comprising an expression vector of the 

invention can be administered to stimulate an immune response in a subject by various 
routes including, for example, orally, intravaginally, rectally, or parenterally, such as 
intravenously, intramuscularly, subcutaneously, intraorbitally, intracapsularly, 
intraperitoneally, intracistemally or by passive or facilitated absorption through the skin 

30 using, for example, a skin patch or transdermal iontophoresis, respectively. Furthermore, 
the composition can be administered by injection, intubation or topically, the latter of 
which can be passive, for example, by direct application of an ointment or powder, or 
active, for example, using a nasal spray or inhalant. An expression vector also can be 
administered as a topical spray, in which case one component of the composition is an 
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appropriate propellant. The pharmaceutical composition also can be incorporated, if 
desired, into liposomes, microspheres or other polymer matrices (Feigner et al, U.S. 
Patent No. 5,703,055; Gregoriadis, Liposome Technology, Vols. I to III (2nd ed. 1993), 
each of which is incorporated herein by reference). Liposomes, for example, which 
5 consist of phospholipids or other lipids, are nontoxic, physiologically acceptable and 
metabolizable carriers that are relatively simple to make and administer. 

The expression vectors of the invention can be delivered to the interstitial 
spaces of tissues of an animal body (Feigner et al., U.S. Patent Nos. 5,580,859 and 
5,703,055). Administration of expression vectors of the invention to muscle is a 

1 0 particularly effective method of administration, including intradermal and subcutaneous 
injections and transdennal administration. Transdermal administration, such as by 
iontophoresis, is also an effective method to deHver expression vectors of the invention to 
muscle. Epidermal administration of expression vectors of the invention can also be 
employed- Epidermal administration involves mechanically or chemically irritating the 

1 5 outermost layer of epidermis to stimulate an immune response to the irritant (Carson et 
al., U.S. Patent No. 5,679,647). 

Other effective methods of administering an expression vector of the 
invention to stimulate an immune response include mucosal administration (Carson et al., 
U.S. Patent No. 5,679,647). For mucosal administration, the most effective method of 

20 administration includes intranasal administration of an appropriate aerosol containing the 
expression vector and a pharmaceutical composition. Suppositories and topical 
preparations are also effective for delivery of expression vectors to mucosal tissues of 
genital, vaginal and ocular sites. Additionally, expression vectors can be complexied to 
particles and administered by a vaccine gim. 

25 The dosage to be administered is dependent on the method of 

administration and will generally be between about 0. 1 ng up to about 200 ng. For 
example, the dosage can be from about 0.05 ng/kg to about 50 mg/kg, in particular about 
0.005-5 mg/kg. An effective dose can be determined, for example, by measuring the 
mmiune response after administration of an expression vector. For example, the 

30 production of antibodies specific for the MHC class II epitopes or MHC class I epitopes 
encoded by the expression vector can be measured by methods well known in the art, 
including ELISA or other immunological assays. In addition, the activation of T helper 
cells or a CTL response can be measured by methods well known in the art including, for 
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example, the uptake of ^H-thymidine to measure T cell activation and the release of ^'Cr 
to measure CTL activity (see Examples II and IE below). 

The pharmaceutical compositions comprising an expression vector of the 
invention can be administered to mammals, particularly humans, for prophylactic or 
5 therapeutic purposes. Examples of diseases that can be treated or prevented using the 
expression vectors of the invention include infection with HBV, HCV, HIV and CMV as 
well as prostate cancer, renal carcuioma, cervical carcinoma, lymphoma, condyloma 
acuminatum and acquired immunodeficiency syndrome (AIDS). 

In therapeutic applications, the expression vectors of the invention are 
10 administered to an individual already suffering from cancer, autoimmune disease or 

infected with a virus. Those in the incubation phase or acute phase of the disease can be 
treated with expression vectors of the invention, including those expressing all universal 
MHC class n epitopes, separately or in conjimction with other treatments, as appropriate. 

In therapeutic and prophylactic applications, pharmaceutical compositions 
15 comprising expression vectors of the invention are administered to a patient in an amount 
sufficient to elicit an effective immune response to an antigen and to ameliorate the signs 
or symptoms of a disease. The amount of expression vector to administer that is 
sufficient to ameliorate the signs or symptoms of a disease is termed a therapeutically 
effective dose. The amount of expression vector sufficient to achieve a therapeutically 
20 effective dose will depend on the pharmaceutical composition comprising an expression 
vector of the invention, the maimer of administration, the state and severity of the disease 
being treated, the weight and general state of health of the patient and the judgment of die 
prescribing physician. 

25 All publications and patent applications cited in this specification are 

herein incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be readily 

30 apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing fi-om the spirit 
or scope of the appended claims. 
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EXAMPLES 

The following example is provided by way of illustration only and not by 
way of limitation. Those of skill in the art will readily recognize a variety of noncritical 
parameters that could be changed or modified to yield essentially similar results. 

5 

EXAMPLE I: Construction of Expression Vectors Containing MHC Class II Epitopes 
This example shows construction of expression vectors containing MHC 

class n epitopes that can be used to target antigens to MHC class U molecules. 

Expression vectors comprising DNA constructs were prepared using 
1 0 overlapping oligonucleotides, polymerase chain reaction (PCR) and standard molecular 

biology techniques (Dieffenbach & Dveksler, PCR Primer: A Laboratory Manual (1995); 

Sambrook et al.. Molecular Cloning: A Laboratory Manual (2nd ed., 1989), each of 

which is incorporated herein by reference). 

To generate full length wild type li, the full length invariant chain was 
15 amplified, cloned, and sequenced and used in the construction of the three invariant chain 

constructs. Except where noted, the source of cDNA for all the constructs listed below 

was Mouse Spleen Marathon-Ready cDNAmade fi-om Balb/c males (Clontech; Palo Alto 

CA). The primer pairs were the oligonucleotide 

GCTAGCGCCGCCACCATGGATGACCAACGCGACCTC (SEQ ID NO:40), which is 
20 designated murli-F and contains an Nhel site followed by the consensus Kozak sequence 
and the 5' end of the li cDNA; and the oligonucleotide 

GGTACCTCACAGGGTGACTTGACCCAG (SEQ ID N0:41), which is designated 
murli-R and contains a Kpnl site and the 3' end of the li coding sequence. 

For the PCR reaction, 5 ^il of spleen cDNA and 250 nM of each primer 

25 were combined in a 100 yX reaction with 0.25 mM each dNTP and 2.5 units oiPfu 

polymerase in Pfu polymerase buffer containing 10 mM KCl, 10 mM (NH4)2S04, 20 mM 
Tris-chloride, pH 8.75, 2 mM MgS04, 0.1% TRITON X-100 and 100 ^g/ml bovine serum 
albumin (BSA). A Peridn/Ebner 9600 PCR machine (Perkin Ehner; Foster City CA) was 
used and the cychng conditions were: 1 cycle of 95°C for 5 minutes, followed by 30 

30 cycles of 95°C for 1 5 seconds, 52°C for 30 seconds, and 72°C for 1 minute. The PCR 
reaction was run on a 1% agarose gel, and the 670 base pair product was cut out, purified 
by spinning through a Miliipore Ultra£ree-MC filter (Millipore; Bedford MA) and cloned 
into pCR-Biunt firom Invitrogen (San Diego, CA). Individual clones were screened by 
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sequencing, and a correct clone (named bli#3) was used as a template for the helper 
constructs. 

DNA constructs containing pan DR epitope sequences and MHC II 
targeting sequences derived from the li protein were prepared. The li murine protein has 
been previously described (Zhu & Jones, Nucleic Acids Res. 17:447-448 (1989)), which is 
incorporated herein by reference. Briefly, the liPADRE construct contains the full length 
li sequence with PADRE precisely replacing the CLIP region. The DNA construct 
encodes amino acids 1 through 87 of invariant chain, followed with the 13 amino acid 
PADRE sequence (SEQ ID NO:38) and the rest of the invariant chain DNA sequence 
(amino acids 101-215). The construct was amplified in 2 overlapping halves that were 
joined to produce the final construct. The two primers used to amplify the 5* half were 
murli-F and the oligonucleotide 

CAGGGTCCAGGCAGCCACGAACTTGGCCACAGGTTTGGCAGA (SEQ ID 
NO:42), which is designated liPADRE-R. The liPADRE-R primer includes nucleotides 
303-262 of liPADRE. The 3' half was amplified with the primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID 
NO:43), which is designated liPADRE-F and includes nucleotides 288-330 of liPADRE; 
and murli-R The PGR conditions were the same as described above, and the two halves 
were isolated by agarose gel electrophoresis as described above. 

Ten microliters of each PGR product was combined in a 100 \il PGR 
reaction with an annealing temperature of 50°C for five cycles to generate a Ml length 
template. Primers murli-F and murli-R were added and 25 more cycles carried out. The 
fiill length liPADRE product was isolated, cloned, and sequenced as described above. 
This construct contains the murine li gene with a pan DR epitope sequence substituted for 
the CLIP sequence of li (Figure 1). 

A DNA construct, designated I80T, containing the cytoplasmic domain, the 
transmembrane domain and part of the luminal domain of li fused to a string of multiple 
MHC class n epitopes was constructed (Figure 2). Briefly, the string of multiple MHC 
class n epitopes was constructed with three overiapping oligonucleotides (ohgos). Each 
oligo overlapped its neighbor by 15 nucleotides and the fmal MHC class E epitope string 
was assembled by extending the overlapping oligonucleotides in three sets of reactions 
using PGR The three oligonucleotides were: oligo 1, nucleotides 241-310, 

CTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAA 
CGAAGCTGGAAGAACCC (SEQ ID NO:44); 
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oligo 2, nucleotides 364-295, 

TTCTGGTCAGCAGAAAGAACAGGATAGGAGCGTTTGGAGGGCGATAAGCTGG 
AGGGGTTCTTCCAGCTTC (SEQ ID NO:45); and 
oligo 3, nucleotides 350-42, 

TTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTG 
GCTGCCTGGACCCTGAAG (SEQ ID NO:46). 

For the first PGR reaction. 5 ^tg of oligos 1 and 2 were combined in a 100 
ill reaction containing P> polymerase. A Perkin/Elmer 9600 PGR machine was used and 
the annealing temperature used was 45° C. The PGR product was gel-purified, and a 
second reaction containing the PGR product of oligos 1 and 2 with oligo 3 was annealed 
and extended for 10 cycles before gei purification of the fiill length product to be used as 
a "mega-primer." 

The I80T construct was made by amplifying bli#3 with murli-F and the 
mega-primer. The cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 5 
cycles of 95°C for 15 seconds, 37°G for 30 seconds, and 72°C for I minute. Primer Help- 
epR was added and an additional 25 cycles were carried out with the annealing 
temperature raised to 47°G. The Help-epR primer 

GGTAGCTCAAGCGGCAGGGTTCAGGGTGGAGGGA (SEQ ID NO:47) corresponds 
to nucleotides 438-405. The Ml length I80T product was isolated, cloned, and sequenced 



20 as above. 



The I80T construct (Figure 2) encodes amino acid residues I through 80 of 
n, containing the cytoplasmic domain, the transmembrane domain and part of the luminal 
domain, fiised to a string of multiple MHC class U epitopes coiresponding to: amino acid 
residues 323-339 of ovalbumin 

aieSerGhiAlaValHisAlaAlaHisAlaGluIleAsnGluAlaGlyArg; SEQ ID NO:48); amino 
acid residues 128 to 141 of HBV core antigen (amino acids 

ThrProProAlaTyrArgProProAsnAlaProIleLeu; SEQ ED NO:49); amino acid residues 182 
to 196 of HBV env (amino acids PhePheLeuLeuThrArgneLeuThrlleProGtaSerLeuAsp; 
SEQ ID NO:50); and the pan DR sequence designated SEQ ID NO:38. 

A DNA construct containing the cytoplasmic domain, transmembrane 
domain and a portion of the luminal domain of li fiised to the MHC class U epitope string 
shown in Figure 2 and amino acid residues 101 to 215 of li encoding the trimerization 
region of U was generated (Figure 3). This constmct, designated EThfiill, encodes the 
first 80 amino acids of invariant chain followed by the MHC class R epitope string 
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(replacing CLIP) and the rest of the invariant chain (amino acids 101-215). Briefly, the 
construct was generated as two overlapping halves that were armealed and extended by 
PCR to yield the final product 

The 5 " end of liThfiill was made by amplifying I80T with murli-F (SEQ 
ID NO:40) and Th-Pad-R. The Th-Pad-R primer AGCGGCAGCCTTCAGGGTC (SEQ 
ID N0:51) corresponds to nucleotides 429-411. The 3' half was made by amplifying 
bli#3 with liPADRE-F and murli-R (SEQ ID N0:41). The liPADRE-F primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID NO:52) 
corresponds to nucleotides 402-444. Each PGR product was gel purified and mixed, then 
denatured, annealed, and extended by five cycles of PCR. Primers murli-F (SEQ ID 
NO:40) and murli-R (SEQ ID N0:41) were added and another 25 cycles performed. The 
full length product was gel purified, cloned, and sequenced. 

All of the remaining constructs described below were made essentially 
according to the scheme shown in Figure 18. Briefly, primer pairs IF plus IR, designated 
below for each specific construct, were used to amplify the specific signal sequence and 
contained an overlapping 15 base pair tail identical to the 5' end of the MHC class 11 
epitope string. Primer pair Th-ova-F, ATCAGCCAGGCTGTGCACGC (SEQ ID NO:53), ' 
plus Th-Pad-R (SEQ ID N0:51) were used to amplify the MHC class II epitope string. A 
15 base pair overlap and the specific transmembrane and cytoplasmic tail containing the 
targeting signals were amplified with primer pairs 2F plus 2R. 

All three pieces of each cDNA were amplified using the following 
conditions: 1 cycle of 95°C for 5 minutes, followed by 30 cycles of 95°C for 15 seconds, 
52°C for. 30 seconds, and 72°C for 1 minute. Each of the three firagments was agrose-gel 
purified, and the signal sequence and MHC class H string fi-agments were combined and 
joined by five cycles in a second PCR. After five cycles, primers IF and Th-Pad-R were 
added for 25 additional cycles and the PCR product was gel purified. This signal 
sequence plus MHC class II epitope string firagment was combined with the 
transmembrane plus cytoplasmic tail fi-agment for the final PCR. After five cycles, 
primers IF plus 2R were added for 25 additional cycles and the product was gel purified, 
cloned and sequenced. 

A DNA construct containing the murine immunoglobulin kappa signal 
sequence fused to the T helper epitope string shovra in Figure 2 and the transmembrane 
and cytoplasmic domains of T,AMP-1 was generated (Figure 4) (Granger et a!.. J. Biol. 
Chem. 265:12036-12043 (1990)), which is incorporated by reference (mouse LAMP-1 
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GenBank accession No. M32015). This construct, designated kappaLAMP-Th, contains 
the consensus mouse immunoglobulin kappa signal sequence and was amplified fi-om a 
plasmid containing full length immunoglobulin kappa as depicted in Figure 1 8. The 
primer IF used was the oligonucleotide designated KappaSig-F, 
GCTAGCGCCGCCACCATGGGAATGCAG (SEQ ID NO:54). 

The primer IR used was the oUgonucleotide designated Kappa-Th-R, 
CACAGCCTGGCTGATTCCTCTGGACCC (SEQ ID NO:55). 

The primer 2F used was the oUgonucleotide designated PAD/LAMP-F, 
CTGAAGGCTGCCGCTAACAACATGTTGATCCCC (SEQ ID NO:56). The primer 2R 
used was the oligonucleotide designated LAMP-CYTOR, 
GGTACCCTAGATGGTCTGATAGCC (SEQ ID NO:57). 

A DNA construct containing the signal sequence of H2-M fused to the 
MHC class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-M was generated (Figure 5). The mouse H2-M gene has been described 
previously, Peleraux et al., Immunogenetics 43:204-214 (1996)), which is incorporated 
herein by reference. This construct was designated H2M-Th and was constructed as 
depicted in Figure 18. The primer IF used was the oligonucleotide designated H2-Mb- 
IF, GCC GCT AGC GCC GCC ACC ATG GCT GCA CTC TGG (SEQ ID NO:58). The 
primer IR used was the oligonucleotide designated H2-Mb-1R, CAC AGC CTG GCT 
GAT CCC CAT ACA GTG CAG (SEQ ID NO:59). The primer 2F used was the 
oligonucleotide designated H2-Mb-2F, CTG AAG GCT GCC GCT AAG GTC TCT GTG 
TCT (SEQ ID NO:60). The primer 2R used was the oligonucleotide designated H2-Mb- 
2R, GCG GGT ACC CTA ATG CCG TCC TTC (SEQ ID N0:6 1). 

A DNA construct containing the signal sequence of H2-D0 fused to the 
MHC class II epitope string shown in Figure 2 and the transmembrane and cj^oplasmic 
domains of H2-D0 was generated (Figure 6). The mouse H2-DO gene has been 
described previously (Larhammar et al, J. Biol. Chem. 260:14111-14119 (1985)), which 
is incorporated herein by reference (GenBank accession No. Ml 9423). This construct, 
designated H20-Th, was constructed as depicted in Figure 18. The primer IF used was 
the oligonucleotide designated H2-0b-lF, GCG GCT AGC GCC GCC ACC ATG GGC 
GCT GGG AGG (SEQ ID NO:62). The primer IR used was the oligonucleotide 
designated H2-0b-lR, TGC ACA GCC TGG CTG ATG GAATCC AGC CTC (SEQ ID 
NO:63). The primet 2F used was the oligonucleotide designated H2-Ob-2F, CTG AAG 
GCT GCC GCT ATA CTG AGT GGA GCT (SEQ ID NO:64). The primer 2R used was 
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the oligonucleotide designated H2-Ob-2R, GCC GGT ACC TCATGT GAG ATG TCC 
CG (SEQIDNO:65). 

A DNA construct containing a pan DR epitope sequence (SEQ ID NO:38) 
fused to the amino-terminus of influenza matrix protein is generated (Figure 7). This 
5 construct, designated PADRE-Influenza matrix, contains the universal MHC class 11 
epitope PADRE attached to the amino terminus of the influenza matrix coding sequence. 
The construct is made using a long primer on the 5' end primer. The 5' primer is the 
oligonucleotide 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
1 0 CGCTATGAGTCTTCTAACCGAGGTCGA (SEQ ID NO:66). The 3 ' primer is the 
oligonucleotide TCACTTGAATCGCTGCATCTGCACCCCCAT (SEQ ID NO:67). 
Influenza virus from the America Type Tissue Collection (ATCC) is used as a source for 
the matrix coding region (Perdue et al. Science 279:393-396 (1998)), which is 
incorporated herein by reference (GenBank accession No. AF036358). 
15 A DNA construct containing a pan DR epitope sequence (SEQ ID NO: 3 8) 

fused to the amino-terminus of HBV-S antigen was generated (Figure 8). This construct 
is designated PADRE-HBV-s and was generated by annealing two overlapping 
oligonucleotides to add PADRE onto the amino terminus of hepatitis B surface antigen 
(Michel et al.. Proc. Natl. Acad. Sci. USA 81:7708-7712 (1984); Michel et al, Proa. Natl. 
20 Acad. Sci. USA 92:5307-53 1 1 (1995)), each of which is incorporated herein by reference. 
One oligonucleotide was 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTC (SEQ ID NO:68). The second oligonucleotide was 

CTCGAGAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGGCCATGGTG 
25 GCGGCG (SEQ ID NO:69). When annealed, the oligos have Nhel and Xhol cohesive 
ends. The oUgos were heated to 100°C and slowly cooled to room temperature to anneal. 
A three part ligation joined PADRE with an Xhol-Kpnl fragment containing HBV-s 
antigen into the Nhel pliis Kpnl sites of the expression vector. 

A DNA construct containing the signal sequence of Ig-a fused to the MHC 
30 class n epitope string shown in Figure 2 and the transmembrane and cytoplasmic domains 
of Ig-a was generated (Figure 9). The mouse Ig-a gene has been described previously 
(Kashiwamura et al., J. Immunol. 145:337-343 (1990)), which is incorporated herein by 
reference (GenBank accession No. M31773). This construct, designated Ig-alphaTh, was 
constructed as depicted in Figure 18. The primer IF used was the oligonucleotide 
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designated Ig alpha-lF, GCG GCT AGC GCC GCC ACC ATG CCA GGG GGT CTA 
(SEQ ID NO:70). The primer IR used was the oUgonucIeotide designated Igalpha-IR, 
GCA CAG CCT GGG TGA TGG CCT GGC ATC CGG (SEQ ED N0:71). The primer 2F 
used was the oligonucleotide designated Igalpha-2F, CTG AAG GCT GCC GCT GGG 
5 ATC ATC TTG CTG (SEQ ID NO:72). The primer 2R used was the oligonucleotide 
designated Igalpha-2R, GCG GGT ACC TCATGG CTT TTC CAG CTG (SEQ ID 
NO:73). 

A DNA construct containing the signal sequence of Ig-P fused to the MHC 
class II string shown in Figure 2 and the transmembrane and cytoplasmic domains of IgP 
1 0 was generated (Figure 1 0). The Ig-p sequence is the B29 gene of mouse and has been 
described previously (Hermanson et ai, Proc. Natl. Acad. Sci. USA 85:6890-6894 
(1988)), which is incorporated herein by reference (GenBank accession No. JOS 857). 
This construct, designated Ig-betaTh, was constructed as depicted in Figure 18. The 
primer IF used was the oligonucleotide designated B29-1F (33mer) GCG GCT AGC 
15 GCC GCC ACC ATG GCC ACA CTG GTG (SEQ ID NO:74). The primer IR used was 
the oligonucleotide designated B29-1R (30mer) CAC AGC CTG GCT GAT CGG CTC 
ACC TGA GAA (SEQ ID NO:75). The primer 2F used was the oligonucleotide 
designated B292F (30mer) CTG AAG GCT GCC GCT ATT ATC TTG ATC CAG (SEQ 
ID NO: 76). The primer 2R used was the oligonucleotide designated B29-2R (27mer), 
20 GCC GGT ACC TCA TTC CTG GCC TGG ATG (SEQ ID NO:77). 

A DNA construct containing the signal sequence of the kappa 
immunoglobulin signal sequence fused to the MHC class II epitope string shown in 
Figure 2 was constructed (Figure 1 1). This construct is designated SigTh and was 
generated by using the kappaLAMP-Th construct (shown in Figure 4) and amplifying 
25 with the primer pair KappaSig-F (SEQ ED NO:54) plus Help-epR (SEQ ID NO:47) to 
create SigTh. SigTh contains the kappa immunoglobulin signal sequence fused to the T 
helper epitope string and terminated with a translational stop codon. 

Constructs encoding human sequences corresponding to the above 
described constructs having mouse sequences are prepared by substituting human 
30 sequences for the mouse sequences. Briefly, for the liPADRE construct, corresponding to 
Figure 1 , amino acid residues 1-80 from the human li gene HLA-DR sequence (Figure 
12) (GenBanlc accession No. X00497 M14765) is substituted for the mouse li sequences, 
which is fused to PADRE, followed by human invariant chain HLA-DR amino acid 
residues 1 14-223. For die I80T construct, corresponding to Figure 2, amino acid residues 
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1-80 from the human sequence of li is followed by a MHC class U epitope string. For the 
liThfuU construct, corresponding to Figure 3, amino acid residues 1-80 from the human 
sequence of li, which is fused to a MHC class 11 epitope string, is followed by human 
invariant chain amino acid residues 114-223. 

For the LAMP-Th construct, similar to Figure 4, the signal sequence 
encoded by amino acid residues 1-19 (nucleotides 11-67) of human LAMP-1 (Figure 13) 
(GenBank accession No. J04182), which is fused to the MHC class 11 epitope string, is 
followed by the transmembrane (nucleotides 1163-1213) and cytoplasmic tail 
(nucleotides 1214-1258) region encoded by amino acid residues 380-416 of human 
LAMP-1. 

For the HLA-DM-Th construct, corresponding to Figure 5, the signal 
sequence encoded by amino acid residues 1-17 (nucleotides 1-51) of human HLA-DMB 
(Figure 14) (GenBank accession No. U15085), which is fused to the MHC class II epitope 
string, is followed by the transmembrane (nucleotides 646-720) and cytoplasmic tail 
(nucleotides 721-792) region encoded by amino acid residues 216-263 of human HLA- 
DMB. 

For the HLA-DO-Th construct, corresponding to Figure 6, the signal 
sequence encoded by amino acid residues 1-21 (nucleotides 1-63) of human HLA-DO 
(Figure 15) (GenBank accession No. L29472 J02736 N00052), which is fused to the 
MHC class II epitope string, is followed by the transmembrane (nucleotides 685-735) and 
cytoplasmic tail (nucleotides 736-819) region encoded by amino acid residues 223-273 of 
human HLA-DO. 

For the Ig-alphaTh construct, corresponding to Figure 9, the signal 
sequence encoded by amino acid residues 1-29 (nucleotides 1-87) of human Ig-a MB-1 
(Figure 16) (GenBank accession No. U05259), which is fused to the MHC class 11 epitope 
string, is followed by the transmembrane (nucleotides 424-498) and cytoplasmic tail 
(nucleotides 499-678) region encoded by amino acid residues 142-226 of human Ig-a 
MB-1. 

For the Ig-betaTh construct, corresponding to Figure 10, the signal 
sequence encoded by amino acid residues 1-28 (nucleotides 17-100) of human Ig-p B29 
(Figure 17) (GenBank accession No. M80461), which is fused to the MHC class II 
epitope string, is followed by the transmembrane (nucleotides 500-547) and cytoplasmic 
tail (nucleotides 548-703) region encoded by amino acid residues 156-229 of human Ig-p. 
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The SigTh construct shown in Figure 1 1 can be used in mouse and human. 
Alternatively, a signal sequence derived from an appropriate human gene containing a 
signal sequence can be substituted for the mouse kappa immunoglobulin sequence in the 
Sig Th construct. 

The PADRE-Influenza matrix construct shoAvn in Figure 7 and the 
PADRE-HBVs construct shown in Figure 8 can be used in mouse and human. 

Some of the DNA constructs described above were cloned into the vector 
pEP2 (Figure 19; SEQ ID NO:35). The pEP2 vector was constructed to contain dual 
CMV promoters. The pEP2 vector used the backbone of pcDNA3. l(-)Myc-His A from 
Invitrogen and pERESlhyg from Clontech. Changes were made to both vectors before the 
CMV transcription unit from pIRESlhyg was moved into the modified pcDNA vector. 

The pcDNA3.1(-)Myc-His A vector (http://wavw.invitrogen.com) was 
modified. Briefly, the PvuU fragment (nucleotides 1342-3508) was deleted. ABspHI 
fragment that contains the Ampicillin resistance gene (nucleotides 4404-5412) was cut 
out. The Ampicillin resistance gene was replaced with the kanamycin resistance gene 
from pUC4K (GenBank Accession #X06404). pUC4K was amplified with the primer set: 
TCTGATGTTACATTGCACAAG (SEQ ID NO:78) (nucleotides 1621-1601) and 
GCGCACTCATGATGCTCTGCCAGTGTTACAACC (SEQ ID NO:79) (nucleotides 
682-702 plus the addition of a BspHI restriction site on the 5' end). The PCR product 
was digested with BspHI and ligated into the vector digested with BspHI. The region 
between the Pmel site at nucleotide 905 and the EcoRV site at nucleotide 947 was 
deleted. The vector was then digested with Pmel (cuts at nucleotide 1076) and Apal (cuts 
at nucleotide 1004), Klenow filled in at die cohesive ends and ligated. The Kpnl site at 
nucleotide 994 was deleted by digesting with Kpnl and filling in the ends with Klenow 
DNA polymerase, and ligating. The intron A sequence from CMV (GenBank accession 
M2I295, nucleotides 635-1461) was added by amplifying CMV DNA with the primer set: 
GCGTCTAGAGTAAGTACCGCCTATAGACTC (SEQ ID NO: 80) (nucleotides 635-655 
plus an Xbal site on the 5' end) and CCGGCTAGCCTGCAGAAAAGACCCATGGAA 
(SEQ ID N0:81) (nucleotides 1461-1441 plus an Nhel site on the 3' end). The PCR 
product was digested with Xbal and Nhel and ligated into the Nhel site of the vector 
(nucleotide 895 of the original pcDNA vector) so that the Nhel site was on the 3' end of 
the intron. 

To modify the pIRESlhyg vector (GenBank Accession U89672, 
Clontech), the Kpnl site (nucleotide 91 1) was deleted by cutting and filling in with 
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Klenow. The plasmid was cut with NotI (nucleotide 1254) and Xbal (nucleotide 3196) 
and a polylinker oligo was inserted into the site. The polylinker was formed by annealing 
the following two oligos: 

GGCCGCAAGGAAAAAATCTAGAGTCGGCCATAGACTAATGCCGGTACCG (SEQ 
5 ID NO:82) and 

CTAGCGGTACCGGCATTAGTCTATGGCCCGACTCTAGATTTTTTCCTTGC(SEQ 
ID NO:83). The resulting plasmid was cut with Hindi and the fragment between HincII 
sites 234 and 3538 was isolated and hgated into the modified pcDNA vector. This 
fragment contains a CMV promoter, intron, polylinker, and polyadenylation signal. 

10 The pIREShyg piece and the pcDNA piece were combmed to form pEP2. 

The modified pcDNA3. 1 (-)Myc-His A vector was partially digested with PvuH to isolate 
a linear fragment with the cut downstream of the pcDNA polyadenylation signal (the 
other PvuII site is the CMV intron). The HincII fragment from the modified pIRESlhyg 
vector was hgated into the PvuII cut vector. The polyadenylation signal from the pcDNA 

1 5 derived transcription unit was deleted by digesting with EcoRI (pcDNA nucleotide 95 5) 
and Xhol (pIRESlhyg nucleotide 3472) and replaced with a synthetic polyadenylation 
sequence. The synthetic polyadenylation signal was described in Levitt et al.. Genes and 
Development 3:1019-1025 (1989)). 

Two oligos were annealed to produce a fragment that contained a 
20 polylinker and polyadenylation signal with EcoRI and Xhol cohesive ends. The oligos 
were: 

AArTCGGATATCCAAGCTTGATGAATAAAAGATCAGAGCTCTAGTGATCTGTGT 
GTTGGTTTTTTTGTGTGC (SEQ ID NO:84) and 

TCGAGCACACAAAAAACCAACACACAGATCACTAGAGCTCTGATCTTTTTATT 

25 CATCAAGCTTGGATATCCG (SEQ ID NO:85). 

The resulting vector is named pEP2 and contauis two separate 
transcription units. Both transcription units use the same CMV promoter but each 
contains different intron, polylinker, and polyadenylation sequences. 

The pEP2 vector contains two transcription units. The first transcription 

30 unit contains the CMV promoter initially from pcDNA (nucleotides 210-862 in Figure 
19), CMV intron A sequence (nucleotides 900-1728 in Figure 19), polylinker cloning site 
(nucleotides 1740-1760 in Figure 19) and synthetic polyadenylation signal (nucleotides 
1764-1769 in Figure 19). The second transcription unit, which was inirially derived from 
pIRESlhyg, contains the CMV promoter (nucleotides 3165-2493 in Figure 19). intron 
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sequence (nucleotides 2464-2173 in Figure 19), polylinker clone site (nucleotides 2126- 
2095 in Figarc 19) and bovine growth hormone polyadenylation signal (nucleotides 1979- 
1974 in Figure 19). The kanamycin resistance gene is encoded in nucleotides 4965-4061 
(Figure 19). 

5 The DNA constructs described above were digested with Nhel and Kpnl 

and cloned into the Xbal and Kpnl sites of pEP2 (the second transcription unit). 

Additional vectors were also constructed. To test for the effect of co- 
expression of MHC class I epitopes with MHC class II epitopes, an insert was generated, 
designated AOS, that contains nine MHC class I epitopes. The AOS insert was initially 
10 constructed in the vector pMIN.O (Figure 20; SEQ ID NO:36). Briefly, the AOS insert 
contains nine MHC class I epitopes, six restricted by HLA-A2 and three restricted by 
HLA-Al 1, and the universal MHC class H epitope PADRE. The vector pMIN.O contains 
epitopes from HBV, HIV and a mouse ovalbumin epitope. The MHC class I epitopes 
appear in pMIN.O in the following order: 
^ ^ consensus mouse Ig Kappa signal sequence (pMIN.O amino acid residues 

1-20, nucleotides 16-81) MQVQIQSLFLLLLWVPGSRG (SEQ ID NO:86) encoded by 
nucleotides ATG CAG GTG CAG ATC CAG AGC CTG TTT CTG CTC CTC CTG TGG 
GTG CCC GGG TCC AGA GGA (SEQ ID NO:87); 

HBV pol 149-159 (All restricted) 
20 (pMIN.O amino acid residues 21-31, nucleotides 82-1 14) 

HTLWKAGILYK (SEQ ID NO:88) encoded by nucleotides CAC ACC CTG TGG AAG 
GCC GGA ATC CTG TAT AAG (SEQ ID NO:89); 

PADRE-universal MHC class U epitope (pMIN.O amino acid residues 32- 
45, nucleotides 1 15-153) AKFVAAWTLKAAA (SEQ ID NO:38) encoded by nucleotides 
25 GCC AAG TTC GTG GCT GCC TGG ACC CTG AAG GCT GCC GCT (SEQ ID 
NO:90); 

HBV core 18-27 (A2 restricted) (pMIN.O amino acid residues 46-55, 
nucleotides 154-183) FLPSDFFPSV (SEQ ID N0:91) encoded by nucleotides TTC CTG 
CCT AGC GAT TTC TTT CCT AGC GTG (SEQ ID N0:92); 
30 HTV env 120-128 (A2 restricted) (pMIN.O amino acid residues 56-64, 

nucleotides 184-210) KLTPLCVTL (SEQ ID NO:93) encoded by nucleotides AAG CTG 
ACC CCA CTG TGC GTG ACC CTG (SEQ ID NO:94); 
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HBV pol 551-559 (A2 restricted) (pMIN.O amino acid residues 65-73, 
nucleotides 21 1-237) YMDDWLGA (SEQ ID NO:95) encoded by nucleotides TAT ATG 
GAT GAC GTG GTG CTG GGA GCC (SEQ ID NO:96); 

mouse ovalbumin 257-264 (K*" restricted) (pMIN.O amino acid residues 
74-81, nucleotides 238-261) SIINFEKL (SEQ ED NO:97) encoded by nucleotides AGC 
ATC ATC AAC TTC GAG AAG CTG (SEQ ID NO:98); 

HBV pol 455-463 (A2 restricted) (pMIN.O amino acid residues 82-90, 
nucleotides 262-288) GLSRYVARL (SEQ ID NO:99) encoded by nucleotides GGA CTG 
TCC AGATAC GTG GCT AGG CTG (SEQ ID NO:100); 

HTV pol 476-84 (A2 restricted) (pMIN.O amino acid residues 91-99, 
nucleotides 289-315) ILKEPVHGV (SEQ ID NO:101) encoded by nucleotides ATC CTG 
AAG GAG CCT GTG CAC GGC GTG (SEQ ID NO: 102); 

HBV core 141-151 (All restricted) 

(pMIN.O amino acid residues 100-110, nucleotides 316-348) 
STLPETTWRR (SEQ ID NO: 103) encoded by nucleotides TCC ACC CTG CCA GAG 
ACC ACC GTG GTG AGG AGA (SEQ ID NO: 104); 

mV env 49-58 (All restricted) (pMIN.O amino acid residues 111-120, 
nucleotides 349-378) TVYYGVPVWK (SEQ ID NO: 105) encoded by nucleotides ACC 
GTG TAG TAT GGA GTG CCT GTG TGG AAG (SEQ ID NO: 106); and 

HBV env 335-343 (A2 restricted) (pMIN.O amino acid residues 121-129, 
nucleotides 378-405) WLSLLVPFV (SEQ ED NO: 107) encoded by nucleotides TGG 
CTG AGC CTG CTG GTG CCC TTT GTG (SEQ ID NO: 108). 

The pMIN.O vector contains a Kpnl restriction site (pMIN.O nucleotides 
406-411) and a Nhel restriction site (pMIN.O nucleotides 1-6). The pMIN.O vector 
contains a consensus Kozak sequence (nucleotides 7-18) (GCCGCCACCATG; SEQ ED 
NO: 109) and murine Kappa Ig-light chain signal sequence foUow^ed by a string of 10 
MHC class I epitopes and one universal MHC class 11 epitope. The pMIN.O sequence 
encodes an open reading frame fused to the Myc and His antibody epitope tag coded for 
by the pcDNA 3.1 Myc-His vector. The pMIN.O vector w^as constructed with eight 
oligonucleotides: 

Mini oUgo 

GAGGAGCAGAAACAGGCTCTGGATCTGCACCTGCATTCCCATGGTGGCGGCGC 
TAGCAAGCTTCTTGCGC (SEQ ID NO: 1 10); 
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Min2 oligo 

CCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGAATCCTGTATA (SEQ ID NO: 1 1 1 ); 
Min3 oligo 

TCGCTAGGCAGGAAAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGG 
CCTTATACAGGATTCCGG (SEQ ID NO: 112); 
Min4 oligo 

CTTTCCTGCCTAGCGATTTCTTTCCTAGCGTGAAGCTGACCCCACTGTGCGTGA 
CCCTGTATATGGATGAC (SEQ ID NO: 113); 
Min5 oligo 

CGTACCTGGACAGTCCCAGCTTCTCGAAGTTGATGATGCTGGCT 
CCCAGCACCACGTCATCCATATACAG (SEQ ID NO: 1 14); 
Min6 oligo 

GGACTGTCCAGATACGTGGCTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGT 
GTCCACCCTGCCAGAGAC (SEQ ID NO: 11 5); 
Min7 oligo 

GCTCAGCCACTTCCACACAGGCACTCCATAGTACACGGTCCTCCTCACCACGG 
TGGTCTCTGGCAGGGTG (SEQ ID NO: 11 6); 
Min8 oligo 

GTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACCTGATCTAGAGC 
(SEQ ID NO: 11 7). 

Additional primers were flanking primer 5', GCG CAA GAA GCT TGC 
TAG CG (SEQ ID N0:U8) and flanking primer 3 ', GCT CTA GAT CAG GTA CCC 
CAC (SEQ ID NO: 119). 

The original pMIN.O minigene construction was carried out using eight 
overlapping oligos averaging approximately 70 nucleotides in length, which were 
synthesized and HPLC purified by Operon Technologies Inc. Each oligo overlapped its 
neighbor by 15 nucleotides, and the final multi-epitope minigene was assembled by 
extending the overlapping oligos in three sets of reactions using PGR (Ho et ai. Gene 
77:51-59(1989). 

For the fu^t PGR reaction, 5 |ig of each of two oligos were annealed and 
extended: 1+2, 3+4, 5+6, and 7+8 were combined in 100 \l\ reactions containing 0.25 mM 
each dNTP and 2.5 units of Pfli polymerase in Pfu polymerase buffer containing 10 mM 
KCI, 10 mM (NH4)2S04, 20 mM Tris-chloride, pH 8.75, 2 mM MgS04, 0.1% TRITON 
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X-100 and 100 mg/ml BSA. A Perkin/^lmer 9600 PGR machine was used and the 
annealing temperature used was 5°C below the lowest calculated T™ of each primer pair. 
The full length dimer products were gel-purified, and two reactions containing the 
product of 1-2 and 3-4, and the product of 5-6 and 7-8 were mixed, annealed and 
extended for 10 cycles. Half of the two reactions were then mixed, and 5 cycles of 
anneaUng and extension carried out before flanking primers were added to amplify the 
full length product for 25 additional cycles. The fiill length product was gel purified and 
cloned into pCR-blunt (Invitrogen) and individual clones were screened by sequencing. 
The Min insert was isolated as an Nhel-Kpnl fi-agment and cloned into the same sites of 
pcDNA3.I(-)/Myc-His A (Invitrogen) for expression. The Min protein contains the Myc 
and Kis aniibody epitope tags at its carboxyl-terminal end. 

For all the PGR reactions described, a total of 30 cycles were performed 
using Pfu polymerase and the following conditions: 95°C for 15 seconds, annealing 
temperature for 30 seconds, 72°C for one minute. The anneahng temperature used was 
5°C below the lowest calculated Tm of each primer pair. 

Three changes to pMIN.O were made to produce pMIN.l (Figure 21; SEQ 
ID NO:37, also refen-ed to as pMIN-AOS). The mouse ova epitope was removed, the 
position 9 alanine anchor residue (#547) of HBV pol 551-560 was converted to a valine 
which increased the in vitro binding affinity 40-fold, and a translational stop codon was 
introduced at the end of the multi-epitope coding sequence. The changes were made by • 
amplifying two overiapping fi-agments and combining them to yield the full length 
product. 

The first reaction used the 5' pcDNA vector primer T7 and the primer Min- 
ovaR (nucleotides 247-218) TGGACAGTCCCACTCCCAGCACCACGTCAT (SEQ ID 
NO: 120). The 3' half was amplified with the primers: Min-ovaF (nucleotides 228-257) 
GCTGGGAGTGGGACTGTCGAGGTACGTGGC (SEQ ID N0:121) and Min-StopR 
(nucleotides 390-361) GGTACCTCACACAAAGGGCACCAGCAGGC (SEQ ID 
NO: 122) 

The two firagments were gel purified, mixed, denatured, annealed, and 
filled in with five cycles of PGR. The fiill length fi-agment was amplified with the 
flanking primers T7 and Min-Stop for 25 more cycles. The product was gel purified, 
digested with Nhel and Kpnl and cloned into pcDNA3.I for sequencing and expression. 
The insert fiom pMin.l was isolated as an Nhel-Kpnl fi-agment and cloned mto pEP2 to 
make pEP2-A0S. 
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EXAMPLE II: Assay for T Helper Cell Activation 

This example shows methods for assaying T helper cell activity. One 
method for assaying T helper cell activity uses spleen cells of an immunized organism. 
Briefly, a spleen cell pellet is suspended with 2-3 ml of red blood cell lysis buffer 
containing 8.3 g/liter ammonium chloride in 0.001 M Tris-HCl, pH 7.5. The cells are 
incubated in lysis buffer for 3-5 min at room temperature with occasional vortexing. An 
excess volume of 50 ml of RIO medium is added to the cells, and the cells are pelleted. 
The cells are resuspended and pelleted one or two more times in R2 medium or RIO 
medium. 

The cell pellet is suspended in RIO medium and counted. If the cell 
suspension is aggregated, the aggregates are removed by filtration or by allowing the 
aggregates to settle by gravity. The cell concentration is brought to lOVml, and 100 nl of 
spleen cells are added to 96 well flat bottom plates. 

Dilutions of the appropriate peptide, such as pan DR epitope (SEQ ID 
NO:145), are prepared in RIO medium at 100, 10, 1, 0.1 and 0.01 ^ig/ml, and 100 |il of 
peptide are added to duplicate or triplicate wells of spleen cells. The final peptide 
concentration is 50, 5, 0.5, 0.05 and 0.005 ng/ml. Control wells receive 100 nl RIO 
medium. 

The plates are incubated for 3 days at 37°C. After 3 days, 20 ^1 of 
50 nCi/ml •'H-thymidine is added per well. Cells are incubated for 18-24 hours and then 
harvested onto glass fiber filters. The incorporation of ^H-thymidine into DNA of 
proliferating cells is measured in a beta counter. 

A second assay for T helper cell activity uses peripheral blood 
mononuclear cells (PBMC) that are stimulated in vitro as described in Alexander et ai, 
supra and Sette (WO 95/07,707), as adapted fi-om Manca et al, J. Immunol. 146: 1964- 
1971 (1991), which is incorporated herein by reference. Briefly, PBMC are collected 
fi-om healthy donors and purified over Ficoll-Plaque (Pharmacia Biotech; Piscataway, 
NJ). PBMC are plated in a 24 well tissue culture plate at 4 x 10* cells/ml. Peptides are 
added at a final concentration of 10 jxg/ml. Cultures are incubated at 37°C in 5% CO2. 

On day 4, recombinant interleukin-2 (IL-2) is added at a final 
concentration of 10 ng/ml. Cultures are fed every 3 days by aspirating 1 ml of medium 
and replacing with fi-esh medium containbg IL-2. Two additional stimulations of the T 
cells with antigen are performed on approximately days 14 and 28. The T cells (3 x 
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lOVwell) are stimulated with peptide (10 fig/ml) using autologous PBMC cells (2 x 10* 
irradiated cells/well) (irradiated with 7500 rads) as antigen-presenting cells in a total of 
three wells of a 24 well tissue culture plate. In addition, on day 14 and 28, T cell 
proliferative responses are determined under the following conditions: 2 x 10"* T 
cells/well; 1 x 10^ irradiated PBMC/well as antigen-presenting cells; peptide 
concentration varying between 0.01 and 10 p,g/ral final concentration. The proliferation 
of the T cells is measured 3 days later by the addition of ^H-thymidine (1 nCi/weil) 18 hr 
prior to harvesting the cells. Cells are harvested onto glass filters and ^H-thymidine 
incorporation is measured in a beta plate counter. These results demonstrate methods for 
assaying T helper cell activity by measuring ^H-thymidine incorporation. 

EXAMPLE ni: Assav for Cytotoxic T Lvmphocvte Response 

This example shows a method for assaying cytotoxic T lymphocyte (CTL) 
activity. A CTL response is measured essentially as described previously (Vitiello et ai, 
Eur. J. Immunol. 27:671-678 (1997), which is incorporated herein by reference). Briefly, 
after approximately 10-35 days following DNA immunization, splenocytes from an 
animal are isolated and co-cultured at 37°C with syngeneic, irradiated (3000 rad) peptide- 
coated LPS blasts (1 x 10* to 1.5 x 10* cells/ml) in 10 ml RIO in T25 flasks. LPS blasts 
are obtained by activating splenocytes (1 x 10* to 1.5 x 10* cells/ml) with 25 jig/ml 
lipopolysaccharides (LPS) (Sigma cat. no. L-2387; St. Louis, MO) and 7 [ig/ml dextran 
sulfate (Pharmacia Biotech) in 30 ml RIO medium in T75 flasks for 3 days at 37^. The 
lymphoblasts are then resuspended at a concentration of 2.5 x 10^ to 3.0 x loVml, 
irradiated (3000 rad), and coated with the appropriate peptides (lOO^g/ml) for 1 h at 
37°C. Cells are washed once, resuspended in RIO medium at the desired concentration 
and added to the responder cell preparation. Cultures are assayed for cytolytic activity on 
day 7 in a ^'Cr-release assay. 

For the ^'Cr-release assay, target cells are labeled for 90 min at 37°C with 
150 ^1 sodium ^'chromate (^'Cr) (New England Nuclear; Wihnington DE), washed three 
times and resuspended at the appropriate concentration in RIO medium. For the assay, 
lO** target cells are incubated in the presence of different concentrations of effector cells 
m a final volume of 200 |il in U-bottom 96 well plates in the presence or absence of 1 0 
}ig/ml peptide. Supematants are removed after 6 h at 37°C, and the percent specific lysis 
is determined by the formula: percent specific lysis - iOO x (experimental release - 
spontaneous release), ^maximum release - spontaneous release). To facilitate comparison 
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of responses from different experiments, the percent release data is transformed to lytic 
units 30 per 10* cells (LU30/10*), with I LU30 defmed as the number of effector ceUs 
required to induce 30% lysis of 10* target cells in a 6 h assay. LU values represent the 
LU30/10* obtained in the presence of peptide minus LU30/10* in the absence of peptide. 
These results demonstrate methods for assaying CTL activity by measuring ^'Cr release 
from cells. 



EXAMPLE IV: T Cell Proliferation in Mice Immunized with Expression Vectors 
Encoding MHC Class U Epitopes and MHC Class 11 Targeting Sequences 

This example demonstrates that expression vectors encoding MHC cla^s II 
epitopes and IviHC class II targeting sequences are effective at activating T cells. 

The constructs used in the T cell proliferation assay are described in 
Example I and were cloned into the vector pEP2, a CMV driven expression vector. The 
peptides used for T cell in vitro stimulation are: Ova 323-339, ISQAVHAAHAEINEAGR 
(SEQ ID NO: 123); HBVcorel28, TPPAYRPPNAPILF (SEQ ID NO: 124); HBVenvl82, 
FFLLTRILTIPQSLO (SEQ ID NO: 125); and PADRE, AKFVAAWTLKAAA (SEQ ID 
NO:38). 

T cell proliferation was assayed essentially as described in Example II. 
Briefly, 12 to 16 week old B6D2 F 1 mice (2 mice per construct) were injected with 100 
^g of the indicated expression vector (50 ^ig per leg) in the anterior tibialis muscle. After 
eleven days, spleens were collected from the mice and separated into a single cell 
suspension by Dounce homogenization. The splenocytes were counted and one million 
splenocytes were plated per well in a 96-weIl plate. Each sample was done in triplicate. 
Ten |ig/ml of the corresponding peptide encoded by the respective expression vectors was 
added to each well. One well contained splenocytes without peptide added for a negative 
control. Cells were cultured at 37°C, 5% CO2 for three days. 

After three days, one ^Ci of ^H-thymidine was added to each well. After 
18 hours at 37°C, the cells were harvested onto glass filters and incorporation was 
measured on an LKB P plate counter. The results of the T cell proliferation assay are 
shown in Table 9. Antigenspecific T cell proliferation is presented as the stimulation 
index (SI); this is defined as the ratio of the average ^H-thymidine incorporation in the 
presence of antigen divided by the 'H-thymidine incorporation in the absence of antigen. 

The immunogen "PADRE + IFA" is a positive control where the PADRE 
peptide in incomplete Freund's adjuvant was injected into the mice and compared to the 
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response seen by injecting the MHC class II epitope constructs containing a PADRE 
sequence. As shown in Table 9, most of the expression vectors tested were effective at 
activating T cell proliferation in response to the addition of PADRE peptide. The activity 
of several of the expression vectors was comparable to that seen with immunization with 
the PADRE peptide in incomplete Freund's adjuvant. The expression vectors containing 
both MHC class I and MHC class II epitopes, pEP2-A0S and pcDNA-AOS, were also 
effective at activating T cell proliferation in response to the addition of PADRE peptide. 

These results show that expression vectors encoding MHC class n 
epitopes fused to a ^mc class H targeting sequence is effective at activating T cell 
proHferation and are useful for stimulating an immune response. 

EXAMPL E V: Tn vivo assav Using Transgenic Mice 
A. Maierials and methods 

Peptides were synthesized according to standard F-raoc solid phase 
synthesis methods which have been previously described (Ruppert et al. Cell 74:929 
(1993); Sette et al., Mot. Immunol. 31 :813 (1994)). Peptide purity was determined by 
analytical reverse-phase HPLC and purity was routinely >95%. Synthesis and 
purification of the Theradigm-HBV lipopeptide vaccine is described in (Vitiello et al. J. 
Clin. Invest. 95:341 (1995)). 



Mice 

HLA-A2.1 transgenic mice used in this study were the Fl generation 
derived by crossing transgenic mice expressing a chimeric gene consisting of the al, o2 
domains of HLA-A2.1 and a3 domain of H-2K'' with SJL/J mice (Jackson Laboratory, 
Bar Harbor, ME). This strain will be referred to hereafter as HLA-A2.1/k''-H-2''". The 
parental HLA-A2. I/K" transgenic strain was generated on a C57BL/6 background using 
the transgene and methods described m (Vitiello et al. J. Exp. Med. 173:1007 (1991)). 
HLA-Al transgenic mice used in the current study were identical to those described 
in (Alexander et al, J. Immunol 159:4753 (1997)). 

Cell lines. MHC n unfication. and peptide binding assav 
Target cells for peptide-specific cytotoxicity assays were Jurkat cells 
transfected with the HLA-A2.1/K'' chimeric gene (Vitiello et al. J. Exp. Med 173:1007 
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(1991)) and .221 tumor cells transfected with HLA-Al I/k" (Alexander et al. J. Immunol. 
159:4753 (1997)). 

To measure presentation of endogenously processed epitopes, Jurkat- 
A2.1/K'' cells were transfected with the pMin.l or pMin.2-GFP minigenes then tested in a 
cytotoxicity assay against epitope-specific CTL lines. For transfection, Jurkat-A2.1/K'' 
cells were resuspended at lO' cells/ml and 30 ^g of DNA was added to 600 ^il of cell 
suspension. After electroporating cells in a 0.4 cm cuvette at 0.25 kV, 960 fiFd, cells 
were incubated on ice for 10 min then cultured for 2d in RPMI culture medium. Cells 
were then cultured in medium containing 200 U/ml hygromycin B (Calbiochem, San 
Diego CA) to select for stable transfectants. FACS was used to enrich the fraction of 
green fluorescent protein (GFP)-expressing cells from 15% to 60% (data not shown) 
Methods for measuring the quantitative binding of peptides to purified 
HLA-A2.1 and -Al 1 molecules is described in Ruppert et al. Cell l^.'^l') (1993); Sette et 
al, Mol Immunol 31:813 (1994); Alexander al, J. Immunol 159:4753 (1997). 

All tumor cell lines and splenic CTLs from primed mice were grown in 
culture medium (CM) that consisted of RPMI 1640 medium with Hepes (Life 
'Technologies, Grand Island, NY) supplemented with 10% FBS, 4 mM L-glutamine, 5 X 
10-^M 2-ME, 0.5 mM sodium pyruvate, 100 ^g/ml streptomycin, and 100 U/ml 
penicilhn. 

Construction of min igene multi-epitope DNA plasmids 
pMIN.O and pMIN.l (i.e., pMIN-AOS) were constructed as described 
above and in USSN 60/085,75 1 . 



pMin.l-No PADRE and p Min.l -Anchor pMin.l was amplified using two 
overlapping fragments which was then combined to yield the fixll length product. The 
first reaction used the 5 ' pcDNA vector primer T7 and either primer 
ATCGCTAGGCAGGAACTTATACAGGATTCC (SEQ ID NO: 126) for pMin. 1-No 
PADRE or TGGACAGTCCGGCTCCCAGCACCACGT (SEQ ID NO: 127) for pMin. 1- 
Anchor. The 3' half was amplified with the primers TTCCTGCCTAGCGATTTC (SEQ 
ID NO: 128) (No PADRE) or GCTGGGAGCCGGACTGTCCAGGTACGT (SEQ ID 
NO:129) (Anchor) and Min-StopR. The two fragments generated from amplifying the 5' 
and 3' ends were gel purified, mixed, denatured, annealed, and fiUed in with five cycles 
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of PGR. The full length fragment was flinner amplified with the flanking primers T7 and 
Min-StopR for 25 more cycles. 



pMin.l-NoSig. The Ig signal sequence was deleted from pMin.l by PGR 
amplification with primer GGTAGGGCGGCGACCATGGAGAGCGTGTGGAAGGG 
CGGAATC (SEQ E) NO: 130) and pcDNA rev (Invitrogen) primers. The product was 
cloned into pCR-blunt and sequenced. 

pMin.l -Switch. Three overlapping fragments were amplified from 
pMin.l, combined, and extended. The 5' fragment was amplified with the vector primer 

JD NO: 13 1). The second overlapping fragment was amplified with primers 
AGCCTGCTGGTGCCCTTTGTGATCCTGAAGGAGCCTGTGC (SEQ ID NO: 132) 
and AGCCACGTACCTGGACAGTCCCTTCCACACAGGCACTCCAT (SEQ ID 
NO: 133). Primer TGTCCAGGTACGTGGCTAGGCTGTGAGGTACC (SEQ ID 
NO: 134) and the vector primer pcDNA rev (Invitrogen) were used to amplify the third 
(3') fragment. Fragments 1, 2, and 3 were amplified and gel purified. Fragments 2 and 3 
were mixed, annealed amplified, and gel purified. Fragment 1 was combined with the 
product of 2 and 3, and extended, gel purified and cloned into pcDNA3.1 for expression. 

pMin.2-GFP. The signal sequence was deleted from pMin.O by PGR 
amplification with Min.O-No Sig-5' plus pcDNA rev (Invitrogen) primers 
GCTAGCGCCGCCACCATGCACACCCTGTGGAAGGCCGGAATC (SEQ ID 
NO: 13 5). The product was cloned into pCR-blunt and sequenced. The insert containing 
the open reading frrnie of the signal sequence-deleted multi-epitope construct was cut out 
vnth Nhel plus HindUl and ligated into the same sites of pEGFPNl (Clontech). This 
construct fuses the coding region of the signal-deleted pMin.O construct to the N-tenninus 
of green fluorescent protein (GFP). 



Immunization of mice 

For DNA immunization, mice were pretreated by injecting 50 ^1 of 10 pM 
cardiotoxin (Sigma Chem. Co., #C9759) bilaterally into the tibialis anterior muscle. Four 
or five days later, 100 \ig of DNA diluted in PBS were injected in the same muscle. 



53 - 



wo 99/58658 



PCT/US99/10646 



Theradigm-HBV lipopqjtide (10 mg/ml in DMSO) that was stored at - 
20°C, was thawed for 10 min at 45°C before being diluted 1:10 (v/v) with room 
temperature PBS. Immediately upon addition of PBS, the lipopeptide suspension was 
vortexed vigorously and 100 \il was injected s.c. at the tail base (100 fig/mouse). 

Immunogenicity of individual CTL epitopes was tested by mixing each 
CTL epitope (50 ng/mouse) with the HBV core 128-140 peptide (TPPAYRPPNAPIL 
(SEQ ID NO: 124), 140 ng/mouse) which served to induce I-A''-restricted Th cells. The 
peptide cocktail was then emusUfed in incomplete Freund's adjuvant (Sigma Chem. Co.) 
and 100 ^1 of peptide emulsion was injected s.c. at the tail base. 

In vuro CTI. cultures and cytotoxicity assavs 

Eleven to 14 days after immunization, animals were sacrificed and a single 
cell suspension of splenocytes prepared. Splenocytes from cDNA-primed animals were 
stimulated in vitro with each of the peptide epitopes represented in the minigene. 
Splenocytes (2.5-3.0 X lOVflask) were cultured in upright 25 cm^ flasks in the presence 
of 1 0 ng/ml peptide and 1 0' irradiated spleen cells that had been activated for 3 days with 
LPS (25 |ig/ml) and dextran sulfate (7 ng/ml). Triplicate cultures were stimulated with 
each epitope. Five days later, cultures were fed with fresh CM. After 10 d of i« vitro 
culture, 2-4 X 10* CTLs from each flask were restimulated with 10^ LPS/dextran sulfate- 
activated splenocytes treated with 100 jig/ml peptide for 60-75 min at 37°C, then 
irradiated 3500 rads. CTLs were restimulated in 6-well plates in 8 ml of cytokine-free 
CM. Eighteen hr later, cultures received cytokines contained in con A-activated 
splenocyte supernatant (10-15% final concentration, v/v) and were fed or expanded on the 
third day with CM containing 10-15% cytokine supemate. Five days after restimulation, 
CTL activity of each culture was measured by incubating varying numbers of CTLs with 
lO'' ^'Cr-labelled target cells in the presence or absence of peptide. To decrease 
nonspecific cytotoxicity from NK cells, YAC-1 cells (ATCC) were also added at a YAC- 
1 :*'Cr-labeled target cell ratio of 20:1. CTL activity against the HBV Pol 551 epitope 
was measured by stimulating DNA-primed splenocytes in vitro with the native A- 
containing peptide and testing for cytotoxic activity against the same peptide. 

To more readily compare responses, the standard E:T ratio vs % 
cytotoxicity data cur^'es were converted into LU per 10^ effector cells with one LU 
defined as the lytic activity required to achieve 30% lysis of target cells at a 100:1 E:T 
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ratio. Specific CTL activity (ALU) was calculited by subtracting the LU value obtained 
in the absence of peptide from the LU value obtained with peptide. A given culture was 
scored positive for CTL induction if all of the following criteria were met: 1) ALU >2; 2) 
LU(+ peptide) -r LU(- peptide) > 3; and 3) a >10% difference in % cytotoxicity tested 
with and without peptide at the two highest E:T ratios (starting E:T ratios were routinely 
between 25-50:1). 

CTL lines were generated from pMin. 1 -primed splenocytes through 
repeated weekly stimulations of CTLs with peptide-treated LPS/DxS -activated 
splenocytes using the 6-well culture conditions described above with the exception that 
CTLs were expanded in cytokine-containing CM as necessary during tiie seven day 
stimulation period. 

Cytokine assay 

To measure EFN-y production in response to minigene-transfected target 
cells, 4 X 10"* CTLs were cultured with an equivalent number of minigene-transfected 
Jurkat-Al.l/K" cells in 96-well flat bottom plates. After overnight incubation at 37°C, 
culture supernatant from each well was collected and assayed for IFN-y concenti-ation 
using a sandwich ELIS A. Immulon 11 microtiter wells (Dynatech, Boston, MA) were 
coated overnight at 4°C with 0.2 ^g of anti-mouse IFN-y capture Ab, R4-6A2 
(Pharmingen). After washing wells with PBS/0.1% Tween-20 and blocking with 1% 
BSA, Ab-coated wells were incubated with culture supemate samples for 2 hr at room 
temperature. A secondary anti-IFN-y Ab, XMG1.2 (Pharmingen), was added to wells and 
allowed to incubate for 2 hr at room temperatiu-e. Wells were then developed by 
incubations with Avidin-DH and fmally witii biotinylated horseradish peroxidase H 
(Vectastain ABC kit, Vector Labs, BurUngame, CA) and TMB peroxidase substi-ate 
(Kirkegaard and Peny Labs, Gaithersberg, MD). The amount of cytokine present in each 
sample was calculated using a rIFN-y standard (Pharmingen). 

b. Results 

Selection of epi topes and minigene construct design 
In the first series of experiments, the issue was whether a balanced 
multispecific CTL response could be induced by simple minigene cDNA constiucts tiiat 
encode several dominant HLA class 1-restricted epitopes. Accordingly, nine CTL 
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epitopes were chosen on the basis of their relevance in CTL immimity during HBV and 
HIV infection in humans, their sequence conservancy among viral subtypes, and their 
class I MHC binding affinity (Table 10). Of these nine epitopes, six are restricted by 
HLA-A2.1 and three showed HLA-Al 1 -restriction. One epitope, HBV Pol 551, was 
studied in two alternative fonns: either the wild type sequence or an analog (HBV Pol 
551-V) engineered for higher binding affinity. 

As referenced in Table 10, several independent laboratories have reported 
that these epitopes are part of the dominant CTL response during HBV or HIV infection. 
All of the epitopes considered showed greater than 75% conservancy in primary amino 
acid sequence among the different HBV subtypes and HIV clades. The MHC binding 
a^xjuty of the peptides was also considered in selection of the epitopes. These 
experiment addressed the feasibility of immunizing with epitopes possessing a wide range 
of affinities and. as shown in Table 10, the six HBV and three HIV HLA-restricted 
epitopes covered a spectrum of MHC binding affmities spanning over two orders of 
magnitude, with IC5o% concentrations ranging from 3 nM to 200 nM. 

The immunogenicity of the six A2.1- and three Al 1 -restricted CTL 
epitopes in transgenic mice was verified by co-immunization with a helper T cell peptide 
m an IFA formulation. All of the epitopes induced significant CTL responses in the 5 to 
73 ALU range (Table 10). As mentioned above, to improve the MHC binding and 
immunogenicity of HBV Pol 551, the C-terminal A residue of this epitope was substituted 
with V resulting in a dramatic 40-fold increase in binding affinity to HLA-A2.1 (Table 
1 0). While the parental sequence was weakly or nonimmunogenic in HLA transgenic 
mice, the HBV Pol 551-V analog induced significant levels of CTL activity when 
administered in IFA (Table 10). On the basis of these results, the V analog of the HBV 
Pol 551 epitope was selected for the initial minigene construct. In all of the experiments 
reported herein, CTL responses were measured with target cells coated with the native 
HBV Pol 551 epitope, irrespective of whether the V analog or native epitope was utilized 
for immunization. 

Finally, since previous studies indicated that induction of T cell help 
significantly improved the magnitude and duration of CTL responses (VitieUo et al, J. 
Clin. Invest. 95:341 (1995); Livingston etal. J. Immunol. 159:1383 (1997)), the universal 
Th cell epitope PADRE was also incorporated into the minigene. PADRE has been 
shown previously to have high MHC binding affinity to a wide range of mouse and 
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human MHC class II haplotypes (Alexander et al., Immunity 1 :75 1 (1994)). In particular, 
it has been previously shown that PADRE is highly immunogenic in H-2'' mice that are 
used in the current study (Alexander et al.. Immunity 1:751 (1994)). 

pMin. 1 , the prototype cDNA minigene construct encoding nine CTL 
epitopes and PADRE, was synthesized and subcloned into the pcDNA3.1 vector. The 
position of each of the nine epitopes in the minigene was optimized to avoid junctional 
mouse H-2'' and HLA-A2.1 class I MHC epitopes. The mouse Ig k signal sequence was 
also included at the 5' end of the construct to facilitate processing of the CTL epitopes in 
the endoplasmic reticulum (ER) as reported by others (Anderson et al.. J. Exp. Med. 
174:489 (1991)). To avoid further conformational structure in the translated polypeptide 
gene product that may affect processmg of the CTL epitopes, an ATG stop codon was 
introduced at the 3' end of the minigene construct upstream of the coding region for c- 
myc and poly-his epitopes in the pcDNA3.1 vector. 



Immunogenici ty of pMin.l in transgenic mice 

To assess the capacity of the pMin.l minigene construct to induce CTLs in 
vivo, HLA-A2.1/K''-H-2'"" transgenic mice were immunized intramuscularly with 100 |ig 
of naked cDNA. As a means of comparing the level of CTLs induced by cDNA 
immunization, a control group of animals was also immunized with Theradigm-HBV, a 
palmitolyated lipopeptide consisting of the HBV Core 18 CTL epitope linked to the 
tetanus toxin 830-843 Th cell epitope. 

Splenocytes from immunized animals were stimulated twice with each of 
the peptide epitopes encoded in the minigene, then assayed for peptide-specific cytotoxic 
activity in a ^'Cr release assay. A representative panel of CTL responses of pMin. 1- 
primed splenocytes, shown in Figure 22, clearly indicates that significant levels of CTL 
induction were generated by minigene immunization. The majority of the cultures 
stimulated with the different epitopes exceeded 50% specific lysis of target cells at an E:T 
ratio of 1:1. The results of four independent experiments, compiled in Table 11, indicate 
that the pMin.l construct is indeed highly immunogenic in HLA-A2.1/K''-H-2'"" 
transgenic mice, inducing a broad CTL response directed against each of its six A2.1- 
restricted epitopes. 

To more conveniently compare levels of CTL iaduction among the 
different epitopes, the % cytotoxicity values for each splenocyte culture was converted to 
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ALU and the mean ALU of CTL activity in positive cultures for each epitope was 
determined (see Example V, materials and methods, for positive criteria). The data, 
expressed in this manner in Table 11, confirms the breadth of CTL induction elicited by 
pMin.l immunization since extremely high CTL responses, ranging between 50 to 700 
5 ALU, were observed against the six A2.1 -restricted epitopes. More significantly, the 
responses of several hundred ALU observed for five of the six epitopes approached or 
exceeded that of the Theradigm-HBV lipopeptide, a vaccine formulation known for its 
high CTL-inducing potency (Vitiello et al. J. Clin. Invest. 95:341 (1995); Livingston et 
al, J. Immunol. 159:1353 (1997)). The HBV Env 335 epitope was the only epitope 
1 0 showing a lower mean ALU response compared to lipopeptide (Table 1 1, 44 vs 349 
ALU). 

Processing of minigene epitopes bv transfected cells 

The decreased CTL response observed against HBV Env 335 was 

1 5 somewhat unexpected since this epitope had good A2. 1 binding affinity (IC50%, 5 nM) 
and was also immunogenic when administered in IF A. The lower response may be due, 
at least in part, to the inefficient processing of this epitope firom the minigene polypeptide 
by antigen presenting cells following in vivo cDNA immunization. To address this 
possibility, Jurkat-A2.1.X'' tumor cells were transfected with pMin.l cDNA and the 

20 presentation of the HB\' Env 335 epitope by transfected cells was compared to more 
immunogenic A2.1 -restricted epitopes using specific CTL lines. Epitope presentation 
was also studied using mmor cells transfected with a control cDNA construct, pMin.2- 
GFP, that encoded a similar multi-epitope minigene fused with GFP which allows 
detection of miiugene expression in transfected cells by FACS. 

25 Epitope presentation of the transfected Jurkat cells was analyzed using 

specific CTL lines, with cytotoxicity or IFN-y production serving as a read-out. It was 
found that the levels of CTL response correlated directly with the in vivo immunogenicity 
of the epitopes. Highl>' immunogenic epitopes in vivo, such as HBV Core 18, HIV Pol 
476, and HBV Pol 455, were efficiently presented to CTL lines by pMin.l- or pMin.2- 

30 GFP-transfected cells as measured by IFN-y production (Figure 23 A, >1 00 pg/ml for each 
epitope) or cytotoxic activity (Figure 23 C, >30% specific lysis). In contrast to these high 
levels of m vitro activin-, the stimulation of the HBV Env 335-specific CTL line against 
both populations of transfected cells resulted in less than 12 pg/ml IFN-y and 3% specific 
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lysis. Although the HBV Env 335-specific CTL' line did not recognize the naturally 
processed epitope efficiently, this line did show an equivalent response to peptide-loaded 
target cells, as compared to CTL lines specific for the other epitopes (Figure 23B, D). 
Collectively, these results suggest that a processing and/pr presentation defect associated 
5 with the HBV Env 335 epitope that may contribute to its diminished immunogencity in 
vivo. 

Effect of th e helper T cell epitope PADRE on minigene immunoeenicitv 
Having obtained a broad and balanced CTL response in transgenic mice 
1.0 immunized with a minigene cDNA encoding multiple HLA-A2. 1 -restricted epitopes, next 
possible variables were examined that could influence the immunogenicity of the 
prototype construct. This type of analysis could lead to rational and rapid optimization of 
future constructs. More specifically, a cDNA construct based on the pMin.l prototype 
was synthesized in which the PADRE epitope was deleted to examine the contribution of 
1 5 T cell help in minigene immunogenicity (Figure 24A). 

The results of the immunogenicity analysis indicated that deletion of the 
PADRE Th cell epitope resulted in significant decreases in the frequency of specific CTL 
precursors against four of the minigene epitopes (HBV Core 1 8, HIV Env 120, HBV Pol 
455, and HBV Env 335) as indicated by the 17 to 50% CTL-positive cultures observed 
20 against these epitopes compared to the 90-1 00% frequency in animals immunized with 
the prototype pMin. 1 construct (Figure 25). Moreover, for two of the epitopes. HBV 
Core 18 and HTV Env 120, the magnitude of response in positive cultures induced by 
pMin.l-No PADRE was 20- to 30-fold less than that of the pMin.l construct (Figure 
25A). 

25 

Effect of modulation of MHC binding affinity on epitope immimogenicitv 
Next a construct was synthesized in which the V anchor residue in HBV 
Pol 551 was replaced with alanine, the native residue, to address the effect of decreasing 
MHC binding on epitope immunogenicity (Figure 24B). 
30 Unlike deletion of the Th cell epitope, decreasing the MHC binding 

capacity of the HBV Pol 551 epitope by 40-foId through modification of the anchor 
residue did not appear to affect epitope immunogenicity (Figure 25B). The CTL response 
against the HBV Pol 55 1 epitope, as well as to the other epitopes, measured either by LU 
or firequency of CTL-positive cultures, was very similar between the constructs 
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containing the native A or improved V residue at the MHC binding anchor site. This 
finding reinforces the notion that minimal epitope minigenes can efficiently deliver 
epitopes of vastly different MHC binding afiOnities. Furthermore, this finding is 
particularly relevant to enhancing epitope immunogenicity via different delivery methods, 
especially in light of the fact that the wild type HBV Pol 55 1 epitope was essentially 
nonimmunogenic when delivered in a less potent IFA emulsion. 

Effect of the sig nal sequence on minieene construct immimogenicity 
The signal sequence was deleted firom the pMin.l construct, thereby 
preventing processing of the minigene polypeptide in the ER (Figure 24C). When the 
immunogenicity of the pMin.i-No Sig construct was examined, an overall decrease in 
response was found against four CTL epitopes. Two of these epitopes, fflV Env 120 and 
HBV Env 335, showed a decrease in frequency of CTL-positive cultures compared to 
pMin.l while the remaining epitopes, HBV Pol 455 and HIV Pol 476, showed a 16-fold 
(fi-om 424 to 27 ALU) and 3-fold decrease (709 to 236 ALU) in magnitude of the mean 
CTL response, respectively (Figure 25C). These findings suggest that allowing ER- 
processing of some of the epitopes encoded in the pMin.l prototype construct may 
improve immunogenicity, as compared with constructs that allow only cytoplasmic 
processing of the same panel of epitopes. 

Effect of epitope rearrangement and creation of new junctional epitopes 
In the final construct tested, the immunogenicity of the HBV Env 335 
epitope was analyzed to determine whether it may be influenced by its position at the 3' 
terminus of the minigene construct (Figure 24D). Thus, the position of the Env epitope in 
the cDNA construct was switched with a more immunogenic epitope, HBV Pol 455, 
located in the center of the minigene. It should be noted that this modification also 
created two potentially new epitopes. As shown in Figure 25D, the transposition of the 
two epitopes appeared to affect the immunogenicity of not only the transposed epitopes 
but also more globally of other epitopes. Switching epitopes resulted in obliteration of 
CTL induction against HBV Env 335 (no positive cultures detected out of six). The CTL 
response induced by the terminal HBV Pol 455 epitope. was also decreased but only 
slightly (424 vs 78 mean ALU). In addition to the switched epitopes, CTL induction 
against other epitopes Ln the pMin.l-Switch construct was also markedly reduced 
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compared to the prototvpe construct. For example, a CTL response was not observed 
against the HTV Env 120 epitope and it was significantly diminished against the HBV 
Core 1 8 (4 of 6 positive cultures, decrease in mean ALU from 306 to 52) and HBV Pol 
476 (decrease in mean ALU from 709 to 20) epitopes (Figure 25D). 

As previously mentioned, it should be noted that switching the two 
epitopes had created new junctional epitopes. Indeed, in the pMin.l-Switch construct, 
two new potential CTL epitopes were created from sequences of HBV Env 335-HIV Pol 
476 (LLVPFVIL (SEQ ID NO: 135), H-2K*'-restricted) and HBV Env 335-HBV Pol 551 
(VLGVWLSLLV (SEQ ED NO:136), HLA-A2.1 -restricted) epitopes. Although these 
junctional epitopes have not been examined to determine whether or not they are indeed 
immunogenic, this may account for the low immunogenicity of the HBV Env 335 and 
HTV Pol 476 epitopes. These findings suggest tiiat avoiding junctional epitopes may be 
important in designing multi-epitope mmigenes as is the ability to confirm their 
immunogenicity in vivo in a biological assay system such as HLA transgenic mice. 

Induction of CTLs against Al 1 epitopes encoded in pMin.l 
To further examine the flexibility of the minigene vaccine approach for 
inducing a broad CTL response against not only multiple epitopes but also against 
epitopes restricted by different HLA alleles, HLA-Al l/K** transgenic mice were 
immunized to determine whether the three Al 1 epitopes in the pMin.l construct were 
immunogenic for CTLs, as was the case for the A2.1 -restricted epitopes in the same 
construct. As summarized in Table 12, significant CTL induction was observed in a 
majority of cultures against all three of the HLA-A 11 -restricted epitopes and the level of 
CTL immunity induced for the three epitopes, in the range of 40 to 260 ALU, exceeded 
that of peptides delivered in IFA (Table 10). Thus, nine CTL epitopes of varying HLA 
restrictions incorporated into a prototype minigene construct all demonstrated significant 
CTL induction in vivo, confirming that minigene DNA plasmids can serve as means of 
delivering multiple epitopes, of varying HLA restrictions and MHC binding affmities, to 
the immune system in an immunogenic fashion and that appropriate transgenic mouse 
strains can be used to measure DNA construct immunogenicity in vivo. 

CTLs were also induced against three Al 1 epitopes in Al l/K*" transgenic 
mice. These responses suggest that minigene delivery of multiple CTL epitopes that 
confers broad population coverage may be possible in humans and that transgenic animals 
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of appropriate haplotypes may be a useful tools in optimizing the in vivo immunogenicity 
of minigene DNA. In addition, animals such as monkeys having conserved HLA 
molecules with cross reactivity to CTL and HTL epitopes recognized by human MHC 
molecules can be used to determine human immunogenicity of HTL and CTL epitopes 
(Bertoni et al, J. /mwwno/. 16 1:4447-4455 (1998)). 

This study represents the fu^t description of the use of HLA transgenic 
mice to quantitate the in vivo immunogenicity of DNA vaccines, by examining response 
to epitopes restricted by human HLA antigens. In vivo studies are required to address the 
variables crucial for vaccine development, that are not easily evaluated by in vitro assays, , 
such as route of administration, vaccine formulation, tissue biodistribution, and 
involvement of primary and secondary lymphoid organs. Because of its simplicity and 
flexibility, HLA transgenic mice represent an attractive alternative, at least for initial 
vaccine development studies, compared to more cumbersome and expensive studies in 
higher animal species, such as nonhuman primates. The in vitro presentation studies 
described aboye further supports the use of HLA transgenic mice for screening DNA 
constructs containing human epitopes inasmuch as a direct correlation between in vivo 
immunogenicity and in vitro presentation was observed. Finally, strong CTL responses 
were observed against all six A 2.1 restricted viral epitopes and in three Al 1 restricted 
epitopes encoded in the prototype pMin.l construct. For five of the A 2.1 restricted 
epitopes, the magnitude of CTL response approximated that observed with the 
lipopeptide, Theradigm-HBV, that previously was shown to induce strong CTL responses 
in humans (Vitiello el al. J. Clin. Invest 95:341 (1995); Livingston et al. J. Immunol. 
159:1383 (1997)). 
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Table 9. Activation of T Cell Proliferation by Expression 
Vectors Encoding MHC Class II Epitopes Fused to MHC 
Class II Targeting Sequences 

5 

Imraunogen Stimulating Peptide' 

PADRE OVA 323 CORE 128 



peptide - CFA^ 


3.0(1.1) 


2.7(1.2) 


3.2 (1.4) 


pEP2.(PA0S).(-) 








pEP2.(A0S).(-) 


5.6(1.8) 






pEP2.(PAOS).(sigTh) 


5.0 (2.9) 




2.6(1.5) 


pEP2.(PA0S).(IgaTh) 


5.6 (2.1) 




3.0(1.6) 


pEP2.(PA0S).(LampTh) 


3.8 (1.7) 




3 


pEP2.(PA0S).(IiTh) 


5.2 (2.0) 


3.2(1.5) 


3.7(1.5) 


pEP2.(PA0S).(H2M) 


3.3 (1.3) 




2.8 



'Geometric mean of cultures with SI > 2. 
^Proliferative response measured in the lymph node. 
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Table 10 
CTL Epitopes in cDNA Minigene 

Immunogenicity In Vivo (IFA) 



MHC No. CTL- CTL Response 
MHO Binding Positive (Geo. Mean 
Restrict. AfTmity Cultures x/h-SD) 









[IC30%(nM) 




ALU 


HBVCore 18 


FLPSDFFPSV 


A2.1 


3 


6/6 


73.0(1.1) 


HBVEnv335 


WLSLLVPFV 


A2.1 


5 


4/6 


5.3(1.6) 


HBV Pol 455 


GLSRYVARL 


A2.1 


76 


ND' 


ND 


HIVEnv 120 


KLTPLCVTL 


A2.1 


102 


2/5 


6.4(1.3) 


HIV Pol 476 


ILKEPVHGV 
YMCDV'.'LGA 
YMDDWLGV 


A2.1 


192 


2/5 


15.2(2.9) 


HBV Pol 551-V 


A2.1 


5 


6/6 


8.2 (2.3) 


HIVEnv 49 


TVYYGVPVWK 


All 




28/33 


13.4 (3.1) 


HBV Core 141 


STLPETTWRR 


All 




6/6 


12.1 (2.6) 


HBV Pol 149 


HTLWKAGILYK 


All 


14 


6/6 


13.1 (1.2) 



a Peptide tested in HLA-A2. 1 /K*" H-2 transgenic mice by co-immunizing with a T helper cell peptide in IF A. 
b Geometric mean CTL response of positive cultures, 
c ND, not done. 
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Table 11 

Summary of Immimogenicity of pMin.l DNA 
construct in HLA A2.1/K'' transgenic mice 



CTL Response " 

Epitope No. Positive Geo. Mean Response Positive 

CulturesA'otal'' Cultures [x/-=-SD] 

ALU 

HBVCorelS 9/9 455.5 [2.2] 

fflVEnv 120 ' 12/12 211.9 [3.7] 

HBVPol551-V 9/9 126.1 [2.8] 

HBVP0I455 12/ 12 738.6 [1.3] 

HrVPol476 11 / 11 716.7 [1.5] 

HBVEnv335 12/ 12 43.7 [1.8] 

HBVCore IS 10/ 10 349.3 [1.8] 
(Theradigm)' 

Mice were immunized with pMin.l DNA or Theradigm-HBV lipopeptide and CTL 
activity in spIenoc\'te cultures was determined after in vitro stimulation with 
individual peptide epitopes. Results from four independent experiments are shown. 

See Example V, Materials and Methods for definition of a CTL-positive culture. 

Response of mice immunized with Theradigm-HBV lipopeptide containing the HBV 
Core 1 8 epitope. 
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Table 12 
Summary of immunogenicity 
in HLA Al l/K** transgenic mice 



CTL Response' 



Epitope 


No. Positive 
Cultures/Total'' 


Geo. Mean Response 
Positive Cultures [x/^ SD] 






ALU 


HBV Core 141 


5/9 


128.1 [1.6] 


HBV Pol 149 


6/9 


267.1 [2.2] 


HIV Env 43 


9/9 


40.1 [2.91 



^ Mice were immunized with pMin. 1 DNA and CTL activity in splenocyte cultures was 
determined after in vitro stimulation with individual Al 1 -restricted epitopes. The 
geometric mean CTL response from three independent experiments are shown. 

*" Definition of a CTL-positive culture is described in Example V, Materials and 
Methods. 
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WHAT IS CLAIMED IS: 

1 1 . An expression vector comprising a promoter operably linked to a 

2 first nucleotide sequence encoding a major histocompatibility (MHC) targeting sequence 

3 fused to a second nucleotide sequence encoding two or more heterologous peptide 

4 epitopes, wherein the heterologous peptide epitopes comprise two HTL peptide epitopes 

5 or a CTL peptide epitope and a universal HTL peptide epitope. 

1 2. The expression vector of claim 1, wherein the heterologotis peptide 

2 epitopes comprise two or more heterologous HTL peptide epitopes. 

1 3. The expression vector of claim 1, wherein the heterologous peptide 

2 epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 

1 4. The expression vector of claim 2, wherein the heterologous peptide 

2 epitopes further comprise one or more CTL peptide epitopes. 

1 5. The expression vector of claim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more CTL peptide epitopes. 

1 6. The expression vector of claim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more HTL peptide epitopes. 

1 7. The expression vector of claim 2, wherein one of the HTL peptide 

2 epitopes is a universal HTL epitope. 

1 8. The expression vector of claim 3 or 7, wherein the universal HTL 

2 epitope is a pan DR epitope. 

1 9. The expression vector of claim 8, wherein the pan DR epitope has 

2 the sequence AlaLysPheValAlaAIaTrpThrLeuLysAIaAlaAla (SEQ ID NO:38). 

1 10. The expression vector of claim 1, wherein the peptide epitopes are 

2 hepatitis B virus epitopes, hepatitis C virus epitopes, human immimodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 
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1 11. The expression vector of claim 1 0, wherein the peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 Tables 1-8. 

1 12. The expression vector of claim 1 1 , wherein at least one of the 

2 peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

1 13. The expression vector of claim 1 , wherein the MHC targeting 

2 sequence comprises a region of a polypeptide selected from the group consisting of the li 

3 protein, LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B 

4 surface antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, lg-|i protein, and 

5 Ig kappa chain signal sequence. 

1 14. The expression vector of claim 1 , wherein the expression vector 

2 fluther comprises a second promoter sequence operably linked to a third nucleotide 

3 sequence encoding one or more heterologous HTL or CTL peptide epitopes. 

1 15. The expression vector of claim 1 , wherein the vector comprises 

2 pMinl or pEP2. 

1 16. The expression vector of claim 3 or 4, wherein the CTL peptide 

2 epitope comprises a structural motif for an HLA supertype, whereby the peptide CTL 

3 epitope binds to two or more members of the supertype with an affinity of greater that 

4 500 nM. 

1 17. The expression vector of claim 4 or 5 , wherein the CTL peptide 

2 epitopes have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 18. A method of inducing an immune response iM VIVO comprising 

2 administering to a mammahan subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding two or more 

5 heterologous peptide epitopes, wherein the heterologous peptide epitopes comprise two 

6 HTL peptide epitopes or a CTL peptide epitope and a tmiversal HTL peptide epitope. 
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19. The method of claun 1 8, wherein the heterologous peptide epitopes 

1 comprise two or more heterologous HTL peptide epitopes. 

[ 20. The method of claim 1 8, wherein the heterologous peptide epitopes 

> comprise a CTL peptide epitope and a universal HTL peptide epitope. 

[ 21. The method of claim 1 9, wherein the heterologous peptide epitopes 

I further comprise one or more CTL peptide epitopes. 

1 22. The method ofclaim 20, wherein the heterologous peptide epitopes 

2 further comprise two or more CTL peptide epitopes. 

1 23 . The method of claim 20, wherein the heterologous peptide epitopes 

2 further comprise two or more HTL peptide epitopes. 

1 24. The method of claim 19, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 25 . The method of claim 20 or 24, wherein the universal HTL epitope 

2 is a pan DR epitope. 

1 26. The method ofclaim 25, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAiaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 27. The method of claim 1 8, wherein the peptide epitopes are hepatitis 

2 B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus epitopes, 

3 human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PAP epitopes, PSM 

4 epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 28. The method ofclaim 27, wherein the peptide epitopes each have a 

2 sequence selected from the group consisting of the peptides depicted in Tables 1-8. 

1 29. The method ofclaim 28, wherein least one ofthe peptide epitopes 

2 is an analog of a pepride depicted in Tables 1-8. 

1 30. The method ofclaim 1 8, wherein the MHC targeting sequence 

2 comprises a region of a poliTeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 
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4 antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protein, and Ig 

5 kappa chain signal sequence. 

1 31. The method of claim 1 8, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 32. The method of claim 1 8, wherein the vector comprises pMin. 1 or 

2 pEP2. 

1 33. The method ofclaim 20 or 21, wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptiuC epitope uinus to 

3 two or more members of the supertype with an affinity of greater that 500 nM. 

1 34. The method of claim 21 or 22, wherein the CTL peptide epitopes 

2 have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 3 5 . A method of inducing an immune response in vivo comprising 

2 administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding a heterologous human 

5 HTL peptide epitope. 

1 36. The method ofclaim 35, wherein the second nucleotide sequence 

2 further comprises two or more heterologous HTL peptide epitopes. 

1 37. The method of claim 35, wherein the second nucleotide sequence 

2 further comprises one or more heterologous CTL peptide epitopes. 

1 38. The method ofclaim 35, wherein the HTL peptide epitope is a 

2 universal HTL peptide epitope 

1 39. The method ofclaim 38, wherein the imiversal HTL epitope is a 

2 pan DR epitope. 

1 40. The method ofclaim 39, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 
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1 41. The method of claim 37, wherein the HTL and CTL peptide 

2 epitopes are hepatitis B virus epitopes, hepatitis C virus epitopes, himian 

3 immunodeficiency virus epitopes, human papilloma virus epitopes, MAGE epitopes, PSA 

4 epitopes, PAP epitopes, PSM epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, 

5 or Plasmodium epitopes. 

1 42 . The method of claim 4 1 , wherein the peptide epitopes each have a 

2 sequence selected firom the group consisting of the peptides depicted in Tables 1-8. 

1 43 . The method of claim 42, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1 -8 . 

1 44. The method of claim 35, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 

4 antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protein, and Ig 

5 kappa chain signal sequence. 

1 45. The method ofclaim 35, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 46. The method of claim 37, wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 

3 two or more members of the supertype with an affinity of greater that 500 nM. 

1 47. The method ofclaim 37, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affmity for more than one HLA allele supertype. 

1 48. A method of assaying the human immimogenicity of a human T 

2 cell peptide epitope in vivo in a non-hiunan mammal, comprising the step of 

3 administering to the non-human mammal an expression vector comprising a promoter 

4 operably linked to a first nucleotide sequence encoding a heterologous human CTL or 

5 HTL peptide epitope. 
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1 49. The method of claim 48, wherein the first nucleotide sequence 

2 encodes two or more heterologous CTL or HTL peptide epitopes. 

1 50 . The method of claim 48, wherein the non-human mammal is a 

2 transgenic mouse that expresses a human HLA allele. 

1 51. The method of claim 50, wherein the human HLA allele is selected 

2 from the group consisting of Al 1 and A2. 1 . 

1 52. The method ofclaim 48, wherein the expression vector further 

2 comprise a second nucleotide sequence encoding a major histocompatiblity (MHC) 

3 targeting sequence. 

1 53. The method ofclaim 48, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 54. The method ofclaim 53, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 55. The method of claim 54, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 56. The method of claim 48, wherein the CTL or HTL peptide epitopes 

2 are hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 57. The method of claim 56, wherein the CTL or HTL peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 .Tables 1-8. 

1 58. The method of claim 57, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1-8. 

1 59. The method ofclaim 52, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 
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3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza, hepatitis B virus core antigen, Ty 

4 particle, Ig-a protein, Ig-P protein, and Ig kappa chain signal sequence. 

1 60. Themethodofclaiin48, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous human CTL or HTL peptide epitopes. 

1 61. The method of claim 48, wherein the vector comprises pMin. 1 or 

2 pEP2. 

1 62. The method of claim 48, wherein the CTL peptide epitope has a 

2 structural motif that provides binding affinity for an HLA allele supertype. 

1 63 . The method of claim 49, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affinity for more than one HLA allele supertype. 

1 64. The method ofclaim 48, wherein the expression vector comprises 

2 both HTL and CTL peptide epitopes. 
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10 20 30 40 50 60 70 



GCTaiGCGCCaCCACCATGGATGACCAACGCGACCrCATCTCTAACCRTGAGCAATTGCCCATACT^^ 
CX3ATCGCGGCGGTGGTACCTACrGGTTGaOT«5AGTAGAGATTGGTACTCGTTAA 

MD3QRDLISNHEQI.PILG> 

80 90 100 110 120 130 140 



ACCGCCCTACSAGAGCCAGAAAGGTGOIGCCGTCWAGCTCTGTACACCGGTC^^ 

tggcgggatctctcggt c tt i ccacgtcggcacctcgagacatgtggccacaaagacaggaccaccgaga 
nrpr2PSR.csrgalytgvsvl'val> 

150 160 170 180 190 200 210 



GCTCTTGGCTGGGCAGGCClACCACrrGCTTACTTCrTGTACCAGCAACAGGGCC^ 

CGAGAACCGACCCGTCCGGTGGrGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
Z,LAGQATTAYFLVQQQGRLDKI.T> 

3?0 230 240 2SQ 260 270 280 



ATCACCrCCCAGAACCTGCaACrrGGAGAGCCrrrCGCATGAAGCrTCCGAAATCTGCCAAACCTGTGGCCA 
TAGTGGAGGGTCTTGGACGTTGACCrrCTCGGAAGaSTACTTCGAAGGCTTTAGACGGTTTGGACACCOT 
ITSQNLQLHSLRMKLPKS AKPVA> 

290 300 310 320 330 340 350 



AGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAACATGCTCCTTGGGCCTGTGAA 
TCAAGCACCXIACGGACCTGGGACTTCCGACXXJCGATACAGGTACCTATTGTACGAGGAACCCGGACACTT 
KFVAAWTLKAAAMSMDNMLLGPVK* 

360 370 380 390 400 410 420 



GAACGTTACCAAGTACGGCAACATGACCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCTG 
CTTGCAATGGTTCATGCCGTTGTACTGGGTCCTGGTACACTACGTAGACGAGTGCTCCAGACCTCGGGAC 
KVTKYGNMTQDHVMHLLTRSGEL> 

430 440 450 460 470 480 490 



GAGTACCCGCAGCTGAAGGGGACCTTCCCAGAGAATCTGAAGCATCTTAAGAACTCCATGGATGGCGTGA 
CTCATGCGCGTCGACTTCCCCrGGAAGGGTCTCrrAGAen-CGTAGAAT-rCTrcAGGTACCTACCGCACT 
EYPQ.LKGTFPEKLKHLKNS MDGV> 

500 510 520 530 540 S50 560 



ACT33GAAGATCTTCGAGAGCTGGATGAAGCAGTGGCTCTTGTTTGAGATGAGCAAGAACTCCCTTC 
TGKCCTTCTAGAAGCrCTCGACCTACTTCGTCACCGAGAACAAACTCTACrCGTTCTTGAGGGACCTCCT 
NWKIFESWMKQWLLFEMS KKSLEE> 

570 580 590 600 610 620 630 



GAAGAAGCCCACCGAGGCTCCACCTAAAGAGCCACTGGACATGGAAGACCTATCTTCTGGCCTGGGAGTG 
CTTCTTCGGGTGGCTCCGAGGTGGATTTCTCGGTGACCTCTACCrXCTGGATAGAAGACCGGACCC^^ 
KKPTEAP?KEPLDMEDLSSGLGV> 

640 650 660 



ACCAGGC3U3GAACTGGGTCAAGTCACCCTGTGAGGTACC 
TGGTCCGTCCTTGACCCAGTTCAGTGGGACACTCCATGG 
TRQELGQVTL«> 

FICORE 1 
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GCTAGCGCCGCCACCATGGATGACCAACGCGACCTCATCTCTAACCATGAGCAATTGCCCATACTGGGCA 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACTCGTTAACGGGTATGACCCGT 
MDDQRDLISNKEQLPILG> 



ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCrcGTGGCT 

TGGCGGGATCTCrCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGAGA 

NRPREPERCSRGALYTGVSVLVAL> 



GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACCAGCAACAGGGCCGCCrAGAC^ 
CGAGAACCGACCCGTCCGGTGGTGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
LLAGQATTAYFLYQQQGRLDKLT> 



ATCACCTCCCAGAACCTGCAACTGGAGAGCCTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTC 
TAGTGGAGGGTCTTGGAC3TTGACCTCTCGGAAGCGTACTTCGAATAGTCGGTCCGACACGTGCGGCGAG 
TTSQNLQLESLRMKLISQAVHAA> 



ACGCCGAAATCAACGAAGCTGGAAGAACCGCTCCAGCTTATCGCCCTCCAAACGCTCCTATCCTGTTCTT 
TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAA 
KAEINEAGRTPPAYRPPNAPILFF> 



TCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAG 
AGACGACTGGTCTTAGGACTGTTAGGGGGTCa-GGGACCTGCGGTTCAAGCACCGACGGACCTGGGACTTC 
LLTRILTIPQSLDAKFVAAWTLK> 



GCTGCCGCTTGAGGTACC 
CGACGGCGAACTCCATGG 
A A A *> 
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GCTAGCGCCGCCACaVTGGATGACCAACGCGACCra^TCTCTAACCA.TGAGCAATTGCCCATACT 
CXaVTCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTflimGATTCGTAC^ 

MDDQRDLISNHEQLPILG> 



ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGGTGGCTCT 
TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGA^ 
NRPR E PERCSRGALYTGV S VL.VAL> 



GCTCrrGGCTGGGCAGGCai.CCACrrGCrrACTTCCTGTACCAGCARCaGGGCCGCCTAGACAAGC^^ 
CGAGAACCGACCCGTCCGGT^TGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
LLAGQATTAYFLYQQQGRLDKLT> 



ATCACCTCCCAGAACCrcCAACI^AGAGCCrTCGCATGAAGCTTATCAGCCAGGCTGTG 
TAGTGGAGGGTCTTGGACGTTGACCrrCTCGGAAGCGTACTTCGAATAGTCGGTCCGACACGTGCGGCGAG. 
ITSQNLQLESLRMKLISQAVHAA> 



HAEINEAGRTPPAYRPPNAPILFF> 
360 370 380 390 400 410 420 

tctgctgaccagaatcctgacaa7cccccagtccctggacgccaagttcgtggctgcctggaccctgaag 
agacgactggtctraggactgttagggggtcagggacctgcggttcargcaccgacggacctgggacttc 
lltrilt:pqsldakfvaawtlk> 



GCTGCCGCTATGTCCATGGATAACATGCTCCTTGGGCCTGTGAAGAACGTTACCAAGTACGGCAACATGA 

cgacggcgatacaggtacctattgtacgaggaacccggacacttcttgcaatggttcatgccgttgtact 

AAAM.S MDNMLLG PVKNVT K YGNM> 

500 SIO S20 S30 540 550 560 

CCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCTGGAGTACCCGCAGCTGAAGGGGACCTT 
GGGTCCTGGTACACrACGTAGACGAGTGCTCCAGACCTGGGGACCTCATGGGCGTCGACrrCCCCTGGAA 
TQDHVMHLLTRSGPLEYPQLKGTF> 



CCCAGAGAATCTGAAGCATCTTAAGAACTCCATGGATGGCGTGAACTGGAAGATCTTCGAGAGCTGG 
GGGTrrCTTAGACTTCGTAGAArrCTTGAGGTACCTACCGCAarrGACCTTCTAGAAGCT^ 

penlkhl?:ksmdgvnwkifeswm> 
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640 650 660 670 680 €90 700 

AAGCAGTGGCTCnrrGTTTGAGATGAGCAAGAACTCCCTGGAGGAGAAGAAGCCCACCGAGGCT^ 
TTCGTCACCGAGAACAAACrCTACTCGTTCTTGAGGGACCTCCTCrrCTTCGGGTGGCT^ 
KQWLLr2MSKNSLEEKK PTEAPP> 

710 720 730 • 740 750 760 770 

AAGAGCCACTGGAOVTGGAAGACCTATCTTCTGGCCTGGGAGTGACCAGGCAGGAACTGGGTCAAGTC^ 
TTCTCGGTGACCTGTACCTTCTGGATAGAAGACCGGACCCTCACTGGTCCGTCCTTGACCCAGTTCAGTG 
KEPLDMEDLSSGLGVTRQELGQVT> 

780 

CCTGTGAGGTACC 
GGACACTCCATGG 
L *> 



FIGURE 3 CONTINUED 
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GCTAGCGCCGCCACCAT«3GAATGCACK3TGCRGATCCAGAGCCTGTTTCTGCTCCTCCra^ 
CGATCGCGGCGGTGGTACCCTTACGTCCACGTCTAGGTCTCGGACAAAGACGAGGAGGACACCCACGGGC 
MGMQVQIQSLFLLLLWVP> 

80 90 100 110 120 130 ^ 140 

GGTCCAGAGGAATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCC 
CCAGGTCTCCTTAGTCGGTCCGACACGTGCGGCGAGTGCGGCTTTAGTTGCrrTCGACCTTCTTGGGGAG^ 
GSRGISQAVHAAHAEINEAGRTPP> 



AGCTTATCGCCCTCCAAACGCTCCTATCCTGTTCTTTCTGCrGACCAGAATCCTGACAATCCCCCAGTCC 
TCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGG 
, ^ ^ r T •rp.Tt,TIPQS> 



CTGGACGCCAAGTrCGTGGCTGCCTGGACCCrGAAGGCTGCCGCTAACAACATGTTGATCCCCATTGCTG 
GACCTGCGGTTCAAGCACCGACGGACCTGGGACTTCCGACGGCGATTGTTGTACAACTAGGGGTAACGAC 
LDAKFVAAWTLKAAANNMLIP rA> 



TGGGCGGTGCCCTGGCAGGGCTGGTCCTCATCGTCCTCATTGCCTACCTCATTGGCAGGAAGAGGAGTCA 
ACCCGCCACGGGACCGTCCCGACCAGGAGTAGCAGGAGTAACGGATGGAGTAACCGTCCTTCTCCTGAGT 
VGGALAGLVLIVLIAYLIGRKRSH> 



CGCCGGCTATCAGACCATCTAGGGTACC 
GCGGCCGATAGTCTGGTAGATCCCATGG 
A G Y Q T I *> 



FIGURE 4 
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GCTAGCGCCGCCACCATGGCTGCACrCKKSCTGCTGCTGCTGGTCCTCAGTCTG^ 
CGATCGCGGCGGTGGTACCGACGTGAGACCGACGACGACGACCAGGAGTCAGACGTGACATACCCCTAGT 
MAALWLLLLVLS LHCMGI> 



GCCAGGCrGTGCACGCCGCrCACGCCGAAATOUVCGAAGCTGGAAGAACCCCnrCCAGCrTATCGCCCrCC 
a3GTCCGACACGTGCGGCGAGTGCGGCTTrAGTTGCTTCGACCITCTTGG<K;AGGTCGAAT^^ 
SQAVEAAHAEIMEAGRT P PAyRPP> 



AAACGCTCCTATCCTGTTCTTTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGT^ 
TTTGCGAGGATAGGACAAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAG 



GTGGCTGCCTGGACCCTGAAGGCTGCCGCTAAGGTCTCrGTGTCTGCAGCCyvCCCTGGGCCrGGacrTCA 
CACCGACGGACCTGGGACTTCCGACGGCGATTCCAGAGACACAGACGTCGGTGGGACCCGGACCCGAAGT 
VAAWTLKAAAKVSVSAATLGLGF> 



TCATCTTCTGTGTTGGCTTCTTCAGATGGCGCAAGTCTCATTCCTCCAGCTACACTCCTCTCCCTGGATC 
AGTAGAAGACACAACCGAAGAAGTCTACCGCGTTCAGAGTAAGGAGGTCGATGTGAGGAGAGGGACCTAG 
IIF CVGFFRWRKSHSSSYTPLPGS> 



CACCTACCCAGAAGGACGGCATTAGGGTACC 
GTGGATGGGTCTTCCTGCCGTAATCCCATGG 
TYPEGRH*> 
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MGAGRAPWVVALLVNLMR> 



TGGATTCCATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATOU^CGAAGCTGGAAGAACCCCrrCC^ 
ISSSJIS^GOTCCGACACGTGCGGCGAGTGCGGCTTTAGTXGCTTC^^^ 

LDSISQAVHAAHAEINEAGRTPPA> 
ISO 160 170 ^ 180 ^ 130 ^ 200 ^ 210 

ttatcgccctccaaIcgctcctatcctgt^ctttctgctgaccagaatcctgacaatcccccm^ 

SlGSSS??^GCGAGGATAGaACAAGAAAGACGACTGGTCTTAGGACT^^^ 

220 230 240 ^ 250 ^ 2S0 ^ 270 ^ 280 

GACG^CAAGCTCGT^CrC-CCTGGlcCCT^AAGGCTGCC^TATACTGAGTGGAGCTGCAGTG^^ 

c^^S^SIgcaccgacggacctgggacttccgacggcgatatgactcacctcg^^^^ 

DAKFVAAWTLKAAAILS GAAVti.> 
iOO 310 320 330 340 350 



290 ■>0C 

TTGG^CTGA^TGTOTCCr^TGG^GTTGTTATCCATCTCAAGGCTCAGAAAGCATCTGTG^^ 
SSGlSS^AGAAGGACCACCCCCAACAATAGGTAGAGTrCCGAGTCT^^^ 

LGLIVFLVGVVIHLKAQKASVETU 



GCCTGGCAATGAGAGTAGGTCCCGGATGATGGAGCGGCTAACCAAGTTCAAGG^^^^ 
CGGACCGTTACTCTCATCCAGGGCCTACTACCTCGCCGATTGGTTCAAGTTCCGACCTGGCCCTGTACAG 

P G N E 



RSRMMERLTKFKAGPGEV> 



430 



ACATGAGGTACC 
TGTACTCCATGG 



FIGURE 6 
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50 60 
iCCTGCACCCTGAAGGCTGCCGCTATGAGTCTTCTAA 



130 



210 



■"^"^ TCCCTTCT7GTGTCTAGAACTCCGAGAGTACCTTACCGATTTCTOTCT 



TGAGGAl 



D V F A G 



L GFVFTLT 

10 340 3S0 



310 320 
ALNGNGDP 



360 370 390 400 410 ''^O 

LYKKLKRHMTFHGAKE 

GALA, SCKGLIYNRMGrVT-rt 

500 SIC 520 530 540 ^ 5S0 ^ 560 

rGCTCATGCCCAACATCGGTCCCACAGGCAGATGGCGACTAC 



GCCTAGTATGTGCCACTTGTGA3CAGATTGCI 

— " '^\t7rAt*ijAjUl ivjx**v3^ 

D A Q H I 

.4. SSO «0 <70 _ «« . 



SSE QAAEAMEVASQARQMVO 
FIGURE 7 
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CAATGAGGACAATTGGGACTCACCCTWXrrcaunrGCRGCTCTA^ 

AMRTIGTHPSSSAGLKDDLIENLQ> 



(MCTrACCAGAAACGGATGGGGCrrGCAGATGCJWK;(»TTCAAGTGA 
CCGAATCCTCTTTGCCTACCCCCAOSTCTACXITCGCTAAGTrCACT 
AYQKRMGVQMQRFK*> 



FIGnUE 7 CONTIUtrED 
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SSS?SS^ScGGCTCTTGTAGTCTAGTCCTAAa3ATCCTG<^CA^^ 

GPCLNAENITSGFLGPLLVLQAGF> 

150 160 170 180 ^ 190 ^ 200 ^ 210 

SlSSIS^AGGAGTGTTATGGCGTCXCAGATCTGAGCACCACa^^^ 



«5GG^AACTACCGTGTGTCTTGGCCyU\AArrCGCAGTCCCCRACCTCCAATCACTCACC^^ 
SS^SJSS^CAGAACCOGTTTTAAGCGTCAGGGGTXGGAGGTTAGT^^^^ 
GGTTVCI-GQKSQSPTSNHSPTSO 

290 300 310 320 ^ 330 ^ 340 ^ 350 

CXCcLc.T;rCCX;G™CGC^AXG;GX™ 

360 • 370 ■ 3a0 ^ 390 ^ 400 ^ 410 ^ 420 

ATGC^CAT^CT^CTTGGTTCTicTGGlcTAT^GGkTGTTGCCCGTTTGTCCTCTA^^^ 

tIcSSISIgaacaaccaagaagacctgatagttccatacaacgooca^^ 



gaggttgaacaggaccaatagcgacctacacagacgccgcaaaatagtagaaggagaagtagga^^ 
pptcpgyrwmclrrfi 



430 440 450 4S0 ^ ^ 

SSTTSTGPCRTCMTTAQGTSMlf 

SCCCTKPSDGMCTCIPIPSSW 



^;tcctItgcoagtgggcc:tcagccx:gtttctcctggctca^ 

TTTTAAGGATACCCTCACCCGGAOTCGGGCAAAGAGGACCGRGTCAAATGATC^^ 

kflwewasarfswlsllvp 



10/42 



640 



PCTAJS99/I0646 
680 S90 700 



TTCGTAGGGCTTTCCCCCACTGTTTGGCTTTCAGTTATATGGATGATGTGGTAT^^ 
AAGCATCCCC^AAGGGGGTCaCAAACCGAAAGTCV^TATACCTACTACACCATAACCCCCGG^ 

G LSPTVWLSVIWMMWYWGPSL> 

710 720 730 740 750 ^ 760 ^ 770 

AraG^TOTGAGTCCCTTTTTACCGCrGTTACCAATTTTCTTTTGTC^ 

TCTCGTAGAACTCAGGGAAAAATGGCGACAATWrnrAAAAGAAAACAGAAACCCATATOT^ 

vslLSPFLPLI.PIFFCr.WVYI*> 



AACAAAACAAAGAGATGGGGTTACTCTCTAA 
TTGTTTTGTTTCTCTACCCCAATGAGAGATT 



FIGURE 8 CONTINUED 
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HPGGLEALRALPLLI.FI-s> 



80 



90 



100 



110 



120 



130 



140 



YACLGPGCQAlSQAVHAAHAEINt^ 



ISO 



ISO 



170 



180 



190 



200 



210 



XGRTPPAYRPPMAPIL* 



L TI PQSLDAKFVAAWTLKAAAOi^ 



300 



310 



TCTGCTGTICTCTCCACTT^TGC^AG^ 

IISSSISISS?SSLg.cc™cga^^^ 

ILLFCAVVPGTLLLFRKRwy 



360 



370 



380 



TGGa;rGGAi.TGcL^AT;ACTA;=AAGITGAAL^ 
ACCCCACCTOTACGGTCTACTCATACrrCTACrrrTTAGAGATACTCCa^ n D C S> 

GVDKPDDYEDENLVEGLNUU 



430 



HQ 



470 



490 



A:rcrlTGAG;ACAT;TCCA;cXX=A;TCCACCGCACCTAC^ 
TACATACTCC^AGAGGTCCCCrGAGGTCCCGTGGATGGTCCTACACCCGTT^ 
MY EDIsRGLQGTYQDVGNLHi 

SOO 510 

CCCAGCTGGAAAAGCCATCAGGTACC 
GGGTCGACCTTTTCCGTACTCCATCG 
A Q L. E K P •> 



FIGURE 9 
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KATLVLSSMPC HWUL.t U U> 
80 90 100 ^ 110 ^ 120 ^ 130 ^ 140 

150 160 170 180 ^ 190 ^ 200 ^ 210 



J^TCCCCTGCCACTGGC; 



TTOSGGAGCrrCGAXTAGCGGGAGGTTTGCGAGGATJ 



230 240 ^ 250 ^ 2o« ^ ^ 



<XX:Aa3AGTAGTACX5AGAAGTAGTAACACGC5GTAGAAGGACC^T^3?J^CTC^ K A G> 

T:- L L X- X L F^- r IV P I^. F L L L D K u 

360 370 " ^ 380 ^ 390 ^ 400 ^ 410 ^ 420 

GATC^AGGAlcATcIcACC^ATt^A^^AAciTTCA^^^ 
■ CTACCTCCTTCTAGTGrraiATACTCCCGAACTrGTAACTGGTCT^^ I V T> 

KEEDHTYEGLNIDQTATVEDX 



CrrCGGACAGGGGAGGTAAAGTGCrrcGGrAGGAGAGCATCCAGGC^^ 
GAAGCCrtrrCCCCTCCATTICACCAGCCATCCTCTCGTAGGTCCGC^ 
LRTGEVKWSVGEHPCQE > 



FIGURE 10 
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GCTRCXXKXXa^CACCATKKSAATGCJi^XTIGCAGATCCAGAG^^^ I'CC XCC'H. C iU-iX;GCrGCCCG 

CGATCGCGCXXaSTCGrACCCTTACCICCAOTIXr^^ 

MGMQVQIQSLFLLLLWVP> 



CCrrCC OSAGGAA TCATCCACGCTCnCCACGCasCT^^ 

CCAGGgoiX»-xrAGTCGSTCa5ACJU:CTGCC^^ 

GSRGISQAVHAAHAEINE 



AGCTTATCGCCCTCCAAACGCTCCTATCCTSTT^^ 
TCaAATAGCGGGAGGTTTGCCAJMATAGGACAAGAAAGACGACTGGT^ 

AYRPPNAPrLFFLLTRILTrPQS> 



CTGGACGCCAAGTTCCrrGGCTGCCTG^ 

GACCTGCtKrrrtlAAGCACCGACGGACCrGGGACrKrCGACt^ 



FIGURE H 
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TTCCCAG ATG CAC AGG AGG AGA AGC AGG AGC TGT CGG GAA GAT CAG AAG 
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gin Lys 
1 S 10 

CCA GTC ATG GAT GAC CAG CGC GAC CTT ATC TCC AAC AAT Ga6 CAA CTG 
III Met ASP ASP Gin Arg Asp Leu He Ser Asn Asn Glu Gin Leu 
IS 20 25 

CCC ATG CTG CGC CGG CGC CCT GGG GCC CCG GAG AGC AAG TGC AGO CGC 
P^o mI^ Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys ser Arg 
35 *0 

GGA GCC CTG TAC ACA GGC TTT TCC ATC CTG GTG. ACT CTG CTC CTC GCT 
Ty Leu Tyr Thr Gly Phe Ser He Leu Val Thr Leu Leu Leu Ala 
SO S5 SO 

r^-^rt r'Utz aac CGG CTG 

neir CAG GCC ACC ACC GCU TAi. l ic uiu a*\v. v-nv^ 

S Sa Thr Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly Arg Leu 
SS 70 75 

GAC AAA CTG ACA GTC ACC TCC CAG AAC CTG CAG CTG GAG AAC CTG CGC 
Su Jhr val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg 
" 80 as ^° 

ATG AAG CTT CCC AAG CCT CCC AAG CCT GTG AGC AAG ATG CGC ATG GCC 
^s Le. pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala 
95 100 105 

ACC CCG CTG CTG ATG CAG GCG CTG CCC ATG GGA GCC CTG CCC CAG GGG 
?S vTo Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly 
115 120 12S 

CCC ATG CAG AAT GCC ACC AAG TAT GGC AAC ATG ACA GAG GAC CAT GTG 
pro Met Gin Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp H.s Val 



,130 



ATG CAC CTG CTC CAG AAT GCT GAC CCC CTG AAG GTG TAC CCG CCA CTG 
M^t Ss Zu Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu 

145 150 
AAG GGG AGC TTC CCG GAG AAC CTG AGA CAC CTT AAG AAC ACC ATG GAG 
Lys Gly ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu 
165 170 



ISO 



ACC ATA GAC TGG AAG GTC TTT GAG AGC TGG ATG CAC CAT TGG CTC CTG 
?S ui ASP Trp Lys val Phe Glu Ser Trp Met His His Trp Leu Leu 
175 



FIGURE 12 
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TTT GAA ATG AGC AGG C^C TCC TTG GAG CAA AAG CCC ACT GAC GCT CCA 
Phe Glu Met Ser Arg His Ser Leu Giu Gla Lys Pro Thr Asp Ala Pro 



195 



CCG AAA GAG TCA CTG GAA CTG GAG GAC CCG TCT TCT GGG CTG GGT GTG 
Pro oil ser Leu Gl. Lau Glu Asp Pro Ser Ser Gly Leu Gly Val 

210 21S 
ACC AAG CAG GAT CTG GGC CCA GTC CCC ATG TGAGAGCAGC AGAGGCGGTC 
Tlir Lys Gin Aso Leu Gly Pro Val Pro Met 
225 230 



FIGURE 12 continued 
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CCGCCTCGGC ATG GCG CCC CGC AGC GCC CGG CGA CCC C^G CTG CTG CTA 
Met Ala Pro Arg Ser Ala Arg Arg Pro Leu Leu Leu Leu 



CTG CCT GTT GCT GCT GCT CGG CCT CAT GCA TTG TCG TCA GCA GCC ATG 
Leu Pro Val Ala Ala Ala Arg Pro His Ala Leu Ser Ser Ala Ala Met 
IS 20 2S 

TTT ATG GTG AAA AAT GGC AAC GGG ACC GCG TGC ATA ATG GCC AAC TTC 
Phe Met val Lys Asa Gly Asa Gly Thr Ala Cys lie Met Ala Asn Phe 
30 35 40 « 

TCT GCT GCC TTC TCA GTG AAC TAC GAC ACC AAG AGT GGC CCC AAG AAC 
Ser Ala Ala Phe Ser Val Asn Tyr Asp Thr Lys Ser Gly Pro Lys Asn 
50 SS 

ATG ACC TTT GAC CTG CCA TCA GAT GCC ACA GTG GTG CTC AAC CGC AGC 
Met Thr Phe Asd Leu Pro Ser Asp Ala Thr Val Val Leu Asn Arg ser 
65 70 7S 

TCC TGT GGA AAA GAG AAC ACT TCT GAC CCC AGT CTC GTG ATT GCT TTT 
Se- Cys Gly Lys Glu Asn Thr Ser Asp Pro Ser Leu Val He Ala Phe 
80 85 90 

GGA AGA GGA CAT ACA CTC ACT CTC AAT TTC ACG AGA AAT GCA ACA CGT 
Gly Arg Gly His Thr Leu Thr Leu Asn Phe Thr Arg Asn Ala Thr Arg 
95 100 105 

TAC AGC GTT CAG CTC ATG AGT TTT GTT TAT AAC TTG TCA GAC ACA CAC 



CTT TTC CCC AAT GCG AGC TCC AAA GAA ATC AAG ACT GTG GAA TCT ATA 
Leu Phe Pro Asn Ala Ser Ser Lys Glu lie Lys Thr Val Glu Ser He 
130 135 140 

ACT GAC ATC AGG GCA GAT ATA GAT AAA AAA TAC AGA TGT GTT AGT GGC 
Thr Asp He Arg Ala Asp He Asp Lys Lys Tyr Arg Cys Val Ser Gly 
145 ISO 155 

ACC CAG GTC CAC ATG AAC AAC GTG ACC GTA ACG CTC CAT GAT GCC ACC 
Thr Gin Val His Met Asn Asn Val Thr Val Thr Leu His Asp Ala Thr 
160 165 170 

ATC CAG GCG TAC CTT TCC AAC AGC AGC TTC AGC AGG GGA GAG ACA CGC 
He Gin Ala Tyr Leu Ser Asn Ser Ser Phe Ser Arg Gly Glu Thr Arg 
175 180 185 



FIGOBZ 13 
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TGT GAA CAA GAC AGG CCT TCC CCA ACC ACA GCG CCC CCT GCG CCA CCC 
CV3 Glu Gin Aso Arg Pro Ser Pro Thr Thr Ala Pro Pro Ala Pro Pro 
190 " 19S 200 205 

AGC CCC TCG CCC TCA CCC GTG CCC AAG AGC CCC TCT GTG GAC AAG TAC 
Se- o-o Se- P^o Ser Pro Val Pro Lys Ser Pro Ser Val Asp Lys Tyr 
210 215 220 

AAC GTG AGC GGC ACC AAC GGG ACC TGC CTG CTG GCC AGC ATG GGG CTG 
Asn Val Ser Gly Thr Asn Gly Thr Cys Leu Leu Ala Ser Met Gly Leu 
225 230 235 

CAG AAC CTC ACC TAT GAS AGG AAG GAC AAC ACG ACG GTG ACA AGG 

Gin lIu Asn Leu Thr Tyr Glu Arg Lys Asp Asn Thr Thr Val" Thr Arg 
240 245 2S0 

erf CTC AAC ATC AAC CCC AAC AAG ACC TCG GCC AGC GGG AGC TGC GGC 



Leu Leu Asn lie Asn fro Asn uys i 



260 2SS 

GCC CAC CTG GTG ACT CTG GAG CTG CAC AGC GAG GGC ACC ACC GTC CTG 
Ala --iis Leu Val Thr Leu Glu Leu His Ser Glu Gly Thr Thr Val Leu 
270 275 280 285 

CTC TTC CAG TTC GGG ATG AAT GCA AGT TCT AGC CGG TTT TTC CTA CAA 
Leu Phe Gin Phe Gly Met Asn Ala Ser Ser Ser Arg Phe Phe Leu Gin 
290 295 300 

GGA ATC CAG TTG AAT ACA ATT CTT CCT GAC GCC AGA GAC CCT GCC TTT 
Gly He Gin Leu Asr. Thr He Leu Pro Asp Ala Arg Asp Pro Ala Phe 
305 310 

AAA GCT GCC AAC GGC TCC CTG CGA GCG CTG CAG GCC ACA GTC GGC AAT 
Lys Ala Ala Asn Gly Ser Leu Arg Ala Leu Gin Ala Thr Val Gly Asn 
320 325 330 

TCC TAC AAG TGC AAC GCG GAG GAG CAC GTC CGT GTC ACG AAG GCG TTT 
Ser T-zr Lys' Cys Asn Ala Glu Glu His Val Arg Val Thr Lys Ala Phe 
335 340 345 

TCA GTC AAT ATA TTC AAA GTG TGG GTC CAG GCT TTC AAG GTG GAA GGT 
Se- Val Asn lie Phe Lvs Val Trp Val Gin Ala Phe Lys Val Glu Gly 
350 353 3S0 365 

GGC CAG TTT GGC TCT GTG GAG GAG TGT CTG CTG GAC GAG AAC AGC ACG 
Glv Gin Phe Gly Ser Val Glu Glu Cys Leu Leu Asp Glu Asn Ser Thr 
370 375 330 



FIGURE 13.C0NTIN0ED 



18/42 



wo 99/58658 



PCT/US99/10646 



CTG ATC CCC ATC GCT GTG GOT GGT GCC CTG GCG GGG CTG GTC CTC ATC 
Leu He Pro He Ala Val Gly Gly Ala Leu Ala Gly Leu Val Leu He 
38S 390 395 

GTC CTC ATC GCC TAC CTC GTC GGC AGG AAG AGG AGT e^C GCA GGC TAG 
Val Leu He Ala Tyr Leu Val Gly Arg Lys Arg Ser His Ala Gly Tyr 

400 405 
CAG ACT ATC TAGCCTGGTG CACGCAGGCA CAGCAGCTGC AGGGGCCTCT 
Gin Thr He 
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80 SO ^ 1°; "° . ": . 

180 190 200 210 



ISO 160 170 

KOLLTCWDPEENKMAP^*- 



230 240 250 260 270 ^ 280 



^^^^,,^P^r— CACAGCACCrCAACCAAAAAGACACCCTGATGCAGCGCTTGCGCAATGGGC 

330 340 3S0 



290 300 310 320 

LQNCATHTQPFWGSLTN 

QVAKTTPFNTREPVMLACX 



"0 ^ 530 ^ S40 ^ S50 ^ S.O 
.aAC;GCCclGCCcL.XGG;~CA;ACC^^^ 
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GGAcIcrrACACCrrGTGTG<rrAGAGCACATT«3GGCTCCTGAGCCCATCe^^ 

DTYTCVVEHIGAPEPILRDWTPO 

640 650 660 ^ 670 ^ 630 ^ 690 ^ 700 

SSSPACCTCXOC^C^^^ 

710 720 730 740 7S0 ^ 760 ^ 770 



rrCTTGGTGTGATCAGCTGGCGGAGAGCTGGCCACTC 



:CTAGTTACACrCCTCTTCCrGGGTCCAATrATTC 



^GAACCACACTAGrCGACCGCCXCTCOACCGCTG^^^^ 



S L G 



790 



AGAAGGATGGCACAnTCCrAG 
TCTTCCTACCGTGTAAAGGATC 
2 G W H I S *> 
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ATGGCrrTCTGGGTGGGTCCCCTGGGTGCrrGGCTCTGCTAGTGAATCrrGACCC^^ 
M G S GWVPWVVALLVNLTQLDSSM> 



100 110 



120 130 



CTCAlGGCACAGACTCTCCAGAAGATTrTGTGATTCAGGCAAAGGCTGACTGTT^ 
S^S^S^GAGAGGTCTTCTAAAACACXAAGTCCGrrXCCGACTGAC^ 
TQGTDSPEDFVIQAKADCYFTW^^ 

150 ISO 170 180 ^ 190 ^ 200 ^ 210 

AGAA;.aOT;CAG™™™ 



220 230 .240 ^ 250 ^ 260 ^ 270 ^ 280 

GGGATG-rTTGTGGclTTGACCAAGCTGGG^CAGC^GATGCTGAGCAGTGG^ 

230 300 310 320 ^ 330 ^ 340 ^ 350 

TGOA^AGGA^CAGA^GGCCGrGGlrGGGGTCTG^AGAcIcAAC^ACA^ 

ACCTCTCCTCGTCTGTCCGGCACCTACCCCAGACATCTGTGTTGATGTCCGACCCGCGTGGGAAGTGACA 
LERSRQAVDGVCRHNYRLGAP- 



c^<^Igaaaagxgcaaccagaggtgacagtgtacccagagaggaccccactc^^ 

CCCC-CTTTTCACGTTGGTCTCCACTGTCACATGGGTCTCTCCTGGGGTGAGGACGTGGTCGTAT.^^ 
G^RKVQPEVTVYPERTPLLHQHNi. 



C-GcIcTGCTCTGTGACAGGCTTCTATCCAGGGGATATCAAGATCAAGTGGT^^^^ 

SSgacgagacactgtccgaagataggtcccctatagttctagttcacca^^^ 

LHCSVTGFYPGDIKIKWFLN^^y 
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AGAGAGCTCKKMTCATGTCCACTGGCCCTATOVGGAATCKAGACTGGACCTTTC^ 

TCTCTCGACCCCAGTAai<MTGACCGGGATAGTCCnn:ACCTCTGACCTGGAAAGTCTGACACCACTACGA 
ERAGVMSTGPIRKGDWTFQTVVML> 

570 580 590 600 610 620 630 

AGAAATGACTCCTGAACTTGGAa^TGTCTACACCTGCCrrGTCGATC».CTCCAGCCTGCTGAGCC 
TCTTTACTGAGGACTTGAACCTGTACAGATGTGGACGGAACAGCTAGTGAGGTCGGACGACTCGGGACAA 

e'mtpelghvytclvdhs SLLSPV> 



640 6S0 660 



S70 680 690 700 



TCTGTGGAGTGGAGAGCTCACn-CTGAATATTCTrGGAGAAAGATGCTGAGTGGCATTGCAGCCrTCC^^ 
AGACACCrCACCTCTCGAGT<»GACTTATAAGAACCrCTrTCTACGACrCACCGTAACGTCGGA^ 
SVEMRAQSEYSWRKMLS GIAAFL> 

710 720 730 740 750 7S0 770 

TTGGGCTAATCTTCCTTCTGGTGGGAATCGTCATCCAGCTAAGGGCTa«3AAAGGATATGTGAGGACGCA 
AACCCGATTAGAAGGAAGACCACCCrTAGCAGTAGGTCGATTCCCGAGTCTTTCCTATACACrCCTGCGT 
LGLIFLLVGIVIQI,RAQKGYVRTQ> 

780 790 800 810 820 

GATGTCTGGTAATGAGGTCTCAAGAGCTGTTCTGCTCCCrCAGTCATGCTAA 
CTACAGACCATTACTCCAGAG'rrCTCGACAAGACGAGGGAGTCRGTACGATT 
M SGNEVSRAVLLPQSC*> 
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J^TGrcrGGGGGTCCAGGAGTCCrCCAAGCTCTGCCTGCCACCATCTTCCrCCTCTC^ 
MPGGPGVLQALPATIFLLFLLSA> 



100 110 



120 



TCTACCTGGGCCCTGGGTGCCAGGCCCTGTGGATGCACAAGGTCCCAGCATCATTGATGGI^ 

ISJ^SSgggacccacggtccgggacacctacgtgttccagggtcgtagt^^^ 

VYLGPGCQALWMHKVE 



ASLMVSLG> 
190 200 210 



ISO ISO 170 130 

c^^vcgcccact^ccaatgcccgcacaItagcIgcaa^cgccaa^^ 

CCTTCTGCGGGTGAAGGrrACGGGCGTGTTATCGTCGTTGTTGCGGxxGw.oxv^^^^ 

EDAHFCCPHNSSNNANVTWWRVl.> 



240 250 



HGNYTW?PSFLGPGEDPNGTLII> 
290 300 310 320 ^ 330 ^ 340 ^ 350 

agaa^gtgaIcaagIgccatgggggcatatacgtgtgccgggtccaggagg^ 

^S?ACACXTGTTCTCGGTACCCCCGTATATGCACACGGCCCAGGTCCTCCCGTTGCT^^^ 
QNVNKSKGGIYVCRVQEGNESYUU 

360 370 380 ^ 390 400 ^ 410 ^ 420 

tgcgccIgccgccccccaggcccttcctggacatgggggagggcacc 
:ggggggtccgggaaggacctgtaccccctcccgtgg 



L D M G E G T> 



GTCCTGCGGCACCTACCTCCGCG' 
CAGGACGCCGTGGATGGAGGCGCACGCGGTCGGO 

SCGTYLRVRQ PPPR 

430 440 450 450 

AAGAlcCGAlTCATkcAGCCGAGGGGATkTCcicCTG^CTGCGCGGTGGTGC^^^ 
^^S^SSJAGTAGTGTCGGCTCCCCrrAGTAGGAGGACAAGACGCGCCACCACGG^^^ 
KNRIITAEGIILLFCAVVPGT 



470 
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500 5X0 520 ^ 530 ^ 540 ^ 550 ^ 560 

ISIS?S??GCrACGGTCTTGCTCTTCGAGCCCAACCTACGGCCCCTAC^AT^^ 
L?RKRWQNEKLGLDAGDKYEDENL> 

570 530 590 600 ^ 610 ^ 620 ^ 630 

TTATLu«3GCCTGAlcCTG^ACGA^TGCTCCATGiATGA^ACATCTCCCG^^ 
SSS^fcScTTGGACCTGCTGACGAGGTACATACTCCTGTAGA^^^^ 

YEGLNLDDCSMYEDlSRGLgi-i 

650 660 ^ 670 ^ 680 ^ 690 ^ 700 

C;«^TGTG;;CAG;CTCAlcATA^i.TCclGCTGkGAA^CCGT^^ 
GrSACACCCGTCGGAGTTGTATCCTCTACAGGTCGACCrCrTCGGCACTGTGG^^ 



Q D V G S L. 



KIGDVQLEKP 
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GAATTCCGCG GTGACC ATG GCC AGG CTG GCG TTG TCT CCT GTG CCC AGC 
Mac Ala Arg Leu Ala Leu Ser Pro Val Pro Ser 
5 10 

CAC TGG ATG GTG GCG TTG CTG CTG CTG CTC TCA GCT GAG CCA GTA CCA 
Kis T^ Met val Ala Leu Leu Leu .Leu Leu Ser Ala Glu Pro Val Pro 
IS 20 25 

GCA GCC AGA TCG GAG GAC CGG TAC CGG AAT CCC AAA GGT AGT GCT TGT 
Ala Ala Arg Ser Glu Asp Arg Tyr Arg Asn Pro Lys Gly Ser Ala Cys 
30 35 40 

TCG CGG ATC TGG CAG AGC CCA CGT TTC ATA GCC AGG AAA CGG CGC TTC 
ser Arg He Trt, Gin Ser Pro Arg Phe He Ala Arg Lys Arg Arg Phe 
45 SO 55 

T^/^ r:rr rrr GGC AAT GTG AGC 

?S vll «ic M^t ser Gly A.n val Ser 

60 70 ^ = 

TGG C^C TGG AAG CAG GAG ATG GAC GAG AAT CCC CAG CAG CTG AAG CTG 
S Leu Tro Lys Gin Glu Met Asp Glu Asn Pro Gin Gin Leu Lys Leu 
80 as 90 

GAA AAG GGC CGC ATG GAA GAG TCC CAG AAC GAA TCT CTC GCC ACC CTC 
Ss Gly Arg Met Glu Glu Ser Gin Asn Glu Ser Leu Ala Thr Leu 
95 100 ^05 

ACC ATC CAA GGC ATC CGG TTT GAG GAC AAT GGC ATC TAC TTC TGC CAG 
Thr 11 e Gin Gly He Arg Phe Glu Asp Asn Gly He Tyr Phe Cys Gin 
110 HS :l20 

CAG AAG TGC AAC AAC ACC TCG GAG GTC TAC CAG GGC TGC GGC ACA GAG 
Gin Lvs cys Asn Asn Thr Ser Glu Val Tyr Gin Gly Cys Gly Thr Glu 
125 130 135 

CTG CGA GTC ATG GGA TTC AGC ACC TTG GCA CAG CTG AAG CAG AGG AAC 
Su Arg val Met Gly Phe Ser Thr Leu Ala Gin Leu Lys Gin Arg Asn 
140 150 

ACG CTG AAG GAT GGT ATC ATC ATG ATC CAG ACG CTG CTG ATC ATC CTC 
Thr Leu Lys Asp Gly He He Met He Gin Thr Leu Leu He He Leu 
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TTC ATC ATC GTG OCT ATC TTC CTG CTG CTG GAC AAG GAT GAC AGC AAG 
ill lie val Pro He Phe Leu Leu Leu Asp Lys Asp Ser Lys 



GCT GGC ATG GAG GAA GAT CAC ACC TAG GAG GGC CTG GAC ATT GAC CAG 
S ^ Glu Glu ASP His Thr Tyr Glu Gly Leu Asp He Asp Gin 
190 

ACA GCC ACC TAT GAG GAC ATA GTG ACG CTG CGG ^CA GGG GAA CTG AAG 
Thr Tyr Glu Asp He Val Thr Leu Arg Thr Gly Glu Val Lys 
,05 210 21S 



TGG TCT GTA GGT GAG CAC CCA GGC CAG GAG TGAGAGCCAG GTCGCCCCAT 
TrT) Ser Val Gly Glu His Pro Gly Gin Glu 

22S 230 



220 
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^^^^.-TTV'TY'Tf-TTr^a^MTCGCTCaGTMTGCGCGAGCRAAATTTAAGCTACA 



220 



240 250 



290 300 310 320 330 340 ^ 350 



570 S80 590 600 ^ 610 ^ "0 ^ "0 

ACCCTGARAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAiU^ 
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720 730 



740 7S0 760 770 



TGGGAGTTTGTTTTCGCACCAAAATCAACGGGACTITCC^ 



820 



840 



ACG(7raGGAGGTCTATATAAGCAC5AGCTCrCTCWCrJUVCrrJ«3ACAAC^ 



lTTCGTCTCGAGAGACCGATTGATCTCTTGGGT 
890 900 910 



CAAATGGGCGGTAGGCX3TGTJ 

GTTTACCCGCCATCCGCACATGCCACCCTCCAGATATAl 
850 860 

CTGciTACT^GCTxlTCGAlArrAlTACGACTCACTATAGGGAGACCCUUICTGGCT^ 
SSiScCGAATAGCTrrAATTATGCTGACTGATATCCCTCTGGGTTCC»CCGATCTC^^ 

920 930 940 9S0 ^ 960 ^ 970 ^ 980 

* . -.. „..^r~,>TV7/~^^nr~\-< f l' l LTT GGCTTGGGGTCTAT 

990 1000 1010 1020 1030 ^ 1040 ^ 1050 

ACACCCCCG^TTCCTCATG^ATA^TGATGGTATAGCTTAGCCTATAGGTGTGGGTTAWG^^ 
^S^CGAAGGAGTACAATATCCACTACCATATCGAATCGGATATCOVCACCCAATAACTGGTAATA 

lOSO 1070 1080 1090 1100 ^ 1110 ^ 1120 

TGAckcTCCCCTA^GTkcGATACTTTCCATTACTAATCCATAACATGGCTCTTTCCC^ 
ISSJS^GGATAACCACTGCTATGAAAGGTAATGATTAGGTArrGTACCGAGAAACGGTGTTGAGAG 

1130 1140 1150 1160 1170 ^ 1180 ^ 1190 

-Jl^„, o^r-^^^-^r-Rf-sr arrnarRCGGACTCTGTATTTTTACAGGATGGGG 



1200 1210 



1240 12S0 



TCTclTTTATTATTTACAAATrCACATATACAACACCACCGTCCCCAGTGCCCGCAGTTTTTA 

IS^™Staaatgtttaagtgtatatgttgtggtggcaggck.tcacgggcgtcaaaaataatttot 

1270 1280 1290 1300 ^ 1310 ^ 1320 ^ 1330 

taacgtgggatctccacgcgaatctcgggtacgtgttccggacatgggctcttctccggtagcggcgg^ 
I?^gSSSI^IStgcgct7Agagcccatgcacaaggcctgtacccgagaagaggccatcgc« 



1340 13S0 1360 1370 

:acatccgagccctg( 
gaagatgtaggctcgggac< 



1390 1400 



cttctacatccgagccctgctcccatgcctccagcgactcatggtcgctcggcagctccttgct^ 

SIS?^SS?GGGACGAGGGTACGGAGGTCGCTGAGTACCAGCGAGCCGTCGAGGAACGAGGATTG 
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^^^^^^^^^^ 

,0 1590 1600 1610 



IS SO 



ISSO 1570 



, i„0 



* ^, * ^orTTT-rcaTCMGTCrrTTCTGCAGGCCAGCCGGCCT 



■AGAATGACACCTACTCAGACAATGCGAT 
.TCTTACTGTGGATGAGTCTGTTACGCTA 



ATCCTCCCtCrrGCTGTCCTGCCCCACCCCACCCCCCAGAATi 
TAGGAGGGGGAACGACAGGACGGGGTGGGGTGGGGGGTCTTA1 



lACCTTCCAGGGTCAAGGAAGGCACGGGGG 

CGTTAAAGGACTAAAATAATCCTTTCCTGTCACCC 
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GGCAimrrCTATGGCCGACTCTAGATTTTCTCCrrGCCM 

cS?ISSS?Sggctgagatctaaaagaggaacgcc«^ 

2iaO 2190 2200 2210 2220 2230 ^ 2240 

2250 2260 2270 2280 2290- 2300 ^2310 



* * -^-.^,,^A-rr-BaiiTrrTCCTCGTTTTTGGAAACTGAC 



JAGGGAGTACTCACCCCAACAGCTGGCCCTCGCAGACA 



2630 

■SI^GCAGCTACCCCGCCTCAACAATGCTGTAARA 



TATAkG;icCTCCCACCGTACACGCCTACCGCCCArrTGCGTCAATGGGGCGGAGTTGTT^^ 

atatatctggagggtggcatgtgcggatggcgggt; 



cccg;gagt;aaac;gcta;ccac;cccattgatgtacxgcca^ 

GGGCACTCAffTTTGGCGATAGGTGCGGGTAACrACAravCGGT^^ 
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2810 2820 2830 2840 2850 2860 2870 

GACTAATACGTAGATGTACTGCCJUVGTAGGAAAGTCCCATAAGGTCATCTACTGGGCATJU^TGCC^ 
CTGATTATGCATCTACATGACGCrrrCATCCTTTCRGGGTATTCCAGTACATGACCC^ 

2880 2390 2900 2910 2920 2930 ^ 2940 

(MCXaTTrACCGTCATTGACGTCAATAGGGGGCGTACTTGGCATATGATACACITGATCTACro 
CCGGTAAATGGCAGTAACTGCAGTTATCCCCCGCATGAACCGTATACTATGTGAACTACATGAC^^ 



r^CAGTrTACCGTAAATAGTCCACCCArrGACGTCflATGGAAAGTCCCTATTGGCGTTACTATGG GAAC 
CCCGTCAAATGGCATTTATCAGGTGGGTAACTGCAGTTACCTTTCAG<KiATAACCGCAAT^ 

302O 3030 3040 3050 3060 3070 ^ 3080 

^TACCTCATTATTGACGTCAATGGGCGGGGGTCGTTGGGCGGTCaGCCAGGCGGOT 
TATGCAGTAATAACTGCAGTTAC 



TATGTAACGCGGAACTCCATATATGGGCTATGAACTAATGACCCCGTAATTGATTACrATTAATAACTAG 
ATACATTGCGCCTTGAGGTA-ArACCCGATACTTGATTACTGGGGCytTTAACTAATGATAATTAT^ 

31S0 3170 3180 3190 3200 3210 ^ 3220 

TCAATAATCAATGTCCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCT 
AGTTATTAGTTACAGGACGTAATTACTTAGCCGGTTGCGCGCCCCTCTCCGCCAAACGCATAACCCGCGA 

3230 3240 3250 3260 3270 3280 ^ 3290 

CTTCCGCTTCCTCGCrCACTGACrCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTC 
GAAGGCGAAGGAGCGAGTGACTGAGCGACGCGAGCCAGCAAGCCGACGCCGCTCGCCATAGTCGAGTGAG 

3300 3310 3320 3330 3340 33S0 ^ 33S0 

AAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAA^^ 
TTTCCGCCArrATGCCAATAGGTGTCnAGTCCCCTATTGCGTCCTTTCTTGTACACTCGTTTTCC^^^ 

3370 3380 3390 3400 3410 ^ 3420 ^ 3430 

CAAAMGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCavTAGGCTCOKZCCCCCT^^ 
GTTTTC<KGTCCTTGGCATTrrTCCGGCGCAACGACCGCAAAAAGGTATCCGAGG^ 

3440 3450 3460 3470 3480 ^ 3490 ^ 3500 

ATCAOUUVAATCGACGCTCAAGTCAGAGGTGGCGAAACCOlACAGGACTATAAAGATACCAGGCGm 
TAGTGTTTrrAGCTGCGAGrrCAGTCTCCACCGCTTTGGGCTGTCCTGATArrrCTATG^ 
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CCCTGGAAGCrCCCTCGTGCGCTCTCCTGrrCCGACCCTGCCGCrrACC<K3ATACCT^ 
GGGACCrrCGAGGGAGCACGCGAGAGGACAAGGCTCKGACCWCGAATGGCCTATGGACA^ 



CCTTCGGGAAGCGTGGCGCTTTCTCAATGCrCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTT 
GGAAGCCCTTCGOVCCGCGAAAGAGTTACGAGTGCGACATCCATAGAGTCAAGCaWyVTCCAGC^^ 



3720 3730 3740 3750 



TGAGTCCAACeCGGTAAGACACGACTTATCGCCACrKKSCAGCAGCWCTGGTAA^^ 
ACTCAGGTTGGGCCa^TTCTGTGCTGAAXAGCGGTGACCGTCGTCGGTGACCATTGTCCTAATCGTCTCGC 



AGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTAT 
TCCATACa.TCCGCCACGATGTCTOVAGAACTTCACCACCGGATTGATGCCGATGTGATCTTCCTGTCATA 



TTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTrGGTAGCTCTTGATCCGGCAAACA 
AACCATAGACGCGAGACGACTTCGGTOUlTGGAAGCCTTTTTCTCAACCATCGAGAACrAGGCCGTT^ 



3930 3940 39S0 3960 



AACCACCGCTGGTAGCGGTGGTTTTTTrGTTTGCAAGCAGCAGArrACGCGCAGAAAAAAAGGATCT 
TTGGTGGCGACCATCGCCACCAAAAAAACAAACGTTCGTCGTCTAATGCGCGTCTTTTrTTCCTAGAGTT 



GAAGATC<iTTTGATCTTrTCTACGGGGTCTGACGCTCAGTGGAACGAAAACrCACGTTAAGGGATr^ 
CnrCrAGGAAACTAGAAAAGATGCCCCAGACTGCGAGTCACcrrGCTTTTGAGTGCAATrCCCTA^ 



,070 4080 4090 4100 



TCATGAACAATAAAACrGrCTGCTTACATAAACAGTAATACAAGGGGTGTTATGAGCCATATrCAACGGG 
AGTACTTGrrATTTTGACAGACGAATGTATTTGTCATTATGTTCCCCACAATACTCGGTATAAGTTGCCC 



AAACGTCTTGCrCGAGGCCGCGATTAAArrCCAACATGGATGCTGATrTATATGGGTATAAATGGGC^ 
TTrG»GAACGAGCrCCGGCGCTAATTTAAGGTTGTACCTACGACTAAATATACCCATATTTACCCGAGC 
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SIwSAGCCCCrrTACrrCCACGCTGTTAGATAGCXAACATACCCT^ 

4280 4290 4300 4310 4320 ^ 4330 ^ 4340 

rTGAAACAT«3aU^AGGTAGCCrrrGCCAAT<»TC^ 
SCTTTS7.CC(rrTTCCATCGCAACGGTTACTACAATOTCTAC^^ 



AArrTATGCCTCTTCCGACCATCAAGCATTTTATCCCrrACTCCTtaTGATGCAT^ 

ttSatacggagaaggctggtagttcgtaaaataggcatgacgactact^^ 



CTS^^^^^TAAGGTCCATAATCTTCTTATAGGACTAAGTCa.CTTTTATAAC^^ 



CTGGCACrrGTTCCTCCGCCGGTTGCATTCGATTCCTGTrrGTAATTGTCCTTTTAACM^ 
SSCTcISAGGACGCGGCCAAaSTAAGCTAAGGACAAACATTAACAGGAAAATTGTCGCT^ 



rrCGicrCGCTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGATGCGAGTGATTTTG^ 

SSSSgcgagtccgcgttagtgcttacttattgccaaaccaactacgctcactaaaactactgctcgc 



TAATGGCrGGCCrGTTGAACAAGTCTGGAAAGAAATGCATAAACTTTTGCCATTCTCACCGGAT^^ 
IJ^AcScGGACAACTTGTTCAGACmcrrTrACGTATrTGAAAACGGTAAGAG^^ 



GTCACTCfli^GTGATTTCTCACTTGATAACCTTATTTTTGACGAGGGGAAATTAATAGGCT^^ 
S^SIcCACTAAAGAGTGAACTATTGGAATAAAAACTGCTCCCCTTTAATTATCCAACATAACTM 



tccttcattacagaarcggctttttcaaaaatatggt; 



:attgataatcctgatatgaataaattgcagttt 



AGGAAGTAATGTCTTTGCCGAAAAAGTrrrTATACCATJ 



:AACTATTAGGACTATACTTATrrAACGTCAAA 
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4910 4920 4930 4940 49S0 4960 4970 

a^TTT(»TGCrCGATGACrrTTTrCTAATCRGAATTGOT 

CTAAACTACGAGCTACTCaAAAAGATTAGTCITAACCAATTAACCJUVCATTG^ 

4980 4990 5000 SOlO S020 S030 S040 

GCGGATROlTATTTGAATCrrArrrftGAAAAATAAACAAATAGGGGTTCCGCGCaCATTTCCCC GAAA^ 
CGCCTATGTATAAACTTACATAAATCTTTTTATTTGTTTATCCCCaACKXXSCGT^ 

5050 

GCCACCTGACGTC 
aSGTGGACTGCAG 
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SATCGOIGCGGTGGTACCCrrACGTCCACGTCTAGGTCTCGGACAAAGACGAG^ 

MGMQVQIQSLFLI.I.LWVP> 

80 90 100 110 120 ^ 130 ^ 140 

r^rAGAci^CACACCCrGTGGAAGGCCGGAATCCrGTATAAGGCXAAGTTCGTGGCTG 
CCAMTCTCCTGTGTGGGACACCTTCa3GCCTTAGGA^ 

GSRGHTLWKAGII.YKAKFVAAWTL> 



GAAG^CTGCCGCTTTCCTGCCTAGCGATTTCTTTCCTAGCGTGAAGCTGACCCCACTGTGCGTGACCC^ 
S^^^ScGAAAGGACGGATCGCrAAAGAAAGOATCGCACTTCGACTC 
KAAArL?SI>FFPSVK ^ 

220 230 240 250 260 ^ 270 ^ 280 

tatatggatL^cgtggtgctgggagccagcatcatcaacttcgagaagctgggactgtcca^^^ 
SSaSLtgcaccacgaccctcggtcgtagtagttgaagctcttcgaccctga^^ 



YMDDVVLGASIINFE 



ctaggctgatcctgaaggagcctgtgcacggcgtgtccaccctgccagagaccaccgtggtgaggagg^^ 

SSS?TAGGACTTCCTCGGACACGTGCCGCACAGGTGGGACGGTCTCrGGT^^^ 
ARLrLKE?VHGVSTLPETTVVRRT> 

360 370 380 



CGTGTACTATGGAGTGCCTGTGTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACC 
GCACATGATACCTCACGGACACACCTTCACCGACTCGGACGACCACGGGAAACACCCATGG 
VYYGVPVWKWLSLI.VP FVGT> 
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GCTAGCGCCGCCACCATG<KAATGCAGGTGCAGATCCACaGCCTGTTTCn:X3CTCCTC 
CGATCGCGGCGGTC3GTACCC?TACGTCCACCrrCrWMTCTCCK3ACAAAGACGAGGAGGACRCC 

MGMQVOIQSLFLLLI'WVP> 

80 90 100 110 120 ^ 130 ^ 140 

GGTCCAGAGGACACACCCTGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTC^^ 
CCaGGTCTCCTGTGTGGGACACCTTCCGGCCTTAGGACATArrCCGGTTCAAGCA^ 

GSRGHTLWKAGILYKAKFVAAWTL* 
ISO 160 170 130 190 ^ 200 ^ 210 

SJSSSSIISI^StcSSISSSItcSSSSggggtgacacgcac^ 

KAAAFLPSDFFPSVKLTPI.CVTI.> 
220 230 240 250 260 ^ 270 ^ 280 

tatatcgatgacgtggtgctgggagtgggactgtccaggtacgtggctaggctgatcctgaaggagcctg 

ATATACCrACTGCACCACGACCCTCACCCTGACAGGTCCATGCACCGATCCGACTAGGACTTCCTCGGAC 
YMDDVVLGVGLSRYVARI<It'KEP> 

290 300 310 320 330 ^ 340 ^ 350 

TGCA^GGCGTGTCcIcCCTGCCAGAGACCACCGTGGXGAGGAGGACCGTGTACTATGGAGTCC^^^ 
ACGTGCCGCACAGGTGGGACGGTCTCTGGTGGCACCACTCCTCCTGGCACATGATACCTCACGGACACA^ 
VKGV STLPETTVVRRTVYYGVPVW, 

360 370 380 390 

GAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGTGAGGTACC 
CTTCACCGACTCGGACGACCACGGGAAACACACTCCATGG 
KWLSLLVPFV-> 
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